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BACKGROUND OF THE INVENTION 



This application claims priority to U.S. Provisional Patent Application No. 
60/437,928 filed on January 3, 2003, U.S. Provisional Patent Application No. 60/446,942 
filed on February 12, 2003, and U.S. Provisional Patent Application No. 60/474,826 filed 
on May 30, 2003, all of which are incorporated by reference in their entireties. 

This invention was made with j ^govemment support under may own rights in 




Institutes of Health. The government has certain rights in the invention. 

1. Field of the Invention 

The present invention relates generally to the fields of molecular genetics, 
pharmacogenetics, and cancer therapy. In particular, the present invention is directed to 
methods and compositions for detecting polymorphisms and correlating the presence or 
absence of certain polymporphisms with toxic effects of chemotherapies. More 
specifically, the present invention is directed to methods and compositions for 
determining the presence or absence of polymorphisms within a uridine diphosphate 
glucuronosyltransferase I Al (UGTlAl) promoter and correlating these polymorphisms 
with toxic effects of irinotecan, as well as evaluating the risk of an individual for 
developing irinotecan toxicity. In some embodiments, the invention concerns methods 
and compositions for predicting or anticipating the level of toxicity caused by irinotecan 
and other compounds glucuronidated by a UGT enzyme in a patient. Such methods and 
compositions can be used to evaluate whether irinotecan-based therapy or therapy 
involving a UGT substrate may pose toxicity problems if given to a particular patient. 
Alterations in suggested therapy may ensue if a toxicity risk is assessed. 

2. Description of Related Art 

Glucuronidation plays a major role in the pharmacological activity and clearance 
of a large variety of compounds (Tukey and Strassburg, 2000). Genetic studies of UDP- 
glucuronosyltransferases (UGTs) aim to characterize an individual's predisposition to 
various diseases and increased risk of adverse outcome to drug treatment. The variation 
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in the UDP-glucuronosyltransferase 1 Al {UGTlAl) gene is the most extensively studied. 
UGTlAl basal expression is affected by the variable number of TA repeats in the TATA 
box, i.e., (TA)n, see U.S. Patent 6,395,481, which is incorporated herein by reference. A 
variable number of repeats (5, 6, 7, and 8) have been found in the UGTlAl TATA box. 
5 Gene transcriptional efficiency has been inversely correlated to the number of TA repeats 
(Beutler et ah, 1998). Thus, a larger TA repeat number is associated with reduced 
transcriptional activity (Beutler et al, 1998) leading to various degrees of impaired 
glucuronidation of UGTlAl substrates. 

Homozygosity for (TA)7 allele is associated with Gilbert's syndrome (a familial 

10 mild hyperbilirubinemia) (Bosma et al, 1995 and Monaghan et al, 1996) and 
predisposition to the toxic effects of cancer treatment with irinotecan (Ando et al, 2000 
and Iyer et al, 2002). Gilbert's syndrome has also been associated with missense coding 
variants in the UGTlAl gene, in particular in Asian populations where these variants are 
relatively common. Increased risk of breast cancer was reported in African-American 

15 women who carried the (TA)7 and (TA)8 alleles (Guillemette et al, 2000). In addition to 
the TATA box, Sugatani et al, (2001) identified a region in the UGTlAl promoter 
approximately 3 kb upstream of the TATA box that regulates UGTlAl inducibility by 
phenobarbital. It is also hypothesized that this phenobarbital-responsive enhancer 
module (PBREM) might be modulated by endogenous factors (Sugatani et al, 2002). 

20 UGTlAl activity is probably the result of PBREM-dependent modulation of TATA box- 
dependent basal expression. 

Polymorphisms in UGTlAl are relevant to the treatment of cancer patients with 
irinotecan. Irinotecan is a topoisomerase I inhibitor that is approved worldwide for the 
treatment of metastatic colorectal cancer. Irinotecan has a well established role as single 

25 agent in 5-fluorouracil-refractory patients (Rougier et al, 1998; Cunningham et al., 
1998), as well as in combination with 5-fluorouracil/leucovorin as a first-line therapy 
(Saltz et al, 2000; Rothenberg et al, 2001). 

Irinotecan hydrolysis by carboxylesterase-2 is responsible for its activation to SN- 
38 (7-ethyl-lO-hydroxycamptothecin), a topoisomerase I inhibitor of much higher 

30 potency than irinotecan. The main inactivating pathway of irinotecan is the 
biotransformation of active SN-38 into inactive SN-38 glucuronide (SN-38G). 
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Interpatient differences in systemic formation of SN-38G have been shown to have clear 
clinical consequences in patients treated with irinotecan. Patients with higher 
glucuronidation of SN-38 are more likely to be protected from the dose limiting toxicity 
of diarrhea in the weekly schedule (Gupta et al., 1994). SN-38 is glucuronidated by 
5 UDP-glucuronosyltransferase 1 Al (UGTlAl) (Iyer et al., 1997). 

Despite its efficacy in treating metastatic colon cancer and its broad spectrum of 
activity in other tumor types, irinotecan treatment is associated with significant toxicity. 
SN-38 is an active metabolite of irinotecan, and SN-38 glucuronidation represents a 
mechanism to protect patients from the toxic effects of irinotecan (Gupta et al, 1994). 

10 Reduced SN-38 glucuronidation is thought to underlie the severe toxicity associated with 
irinotecan treatment in some patients (Gupta et al, 1994). The main severe toxicities of 
irinotecan are delayed diarrhea and myelosuppression. In the early single agent trials, 
grade 3-4 diarrhea occurred in about one third of patients and was dose limiting (Negoro 
et al., 1991; Rothenberg et al., 1993). Its fi-equency varies from study to study and is also 

15 schedule dependent. The frequency of grade 3-4 diarrhea in the three- weekly regimen 
(19%) is significantly lower compared to the weekly schedule (36%, Fuchs et al, 2003). 
In addition to diarrhea, grade 3-4 neutropenia is also a common adverse event, with about 
30-40% of the patients experiencing it in both weekly and three-weekly regimens (Fuchs 
et al, 2003; Vanhoefer et al, 2001). Fatal events during irinotecan treatment have been 

20 reported. A high mortality rate of 5.3% and 1.6% was reported in the weekly and three- 
weekly single agent irinotecan regimens, respectively (Fuchs et al, 2003). 

Although retrospective analysis of UGTlAl genetic variation in relation to severe 
toxicity after different irinotecan-based regimens has been conducted in Japanese patients 
(Ando et al, 2000), prospective evaluation in a large trial has not been performed. 

25 Thus, the problem of identifying the effects of various promoter polymorphism 

combinations on the expression of UGTlAl for the determination of UGT activity levels 
remains. Improved methods and compositions for the evaluation of risk for irinotecan 
toxicity in an individual or patient are still needed. 

30 SUMMARY OF THE INVENTION 
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Metabolism of SN-38, an active metabolite of irinotecan, via glucuronidation 
represents a mechanism to protect patients from the toxic effects of irinotecan, thus a 
reduction in SN-38 glucuronidation contributes to the probability that toxicity associated 
with irinotecan may be experienced in patients. While some genetic basis for reduced 
5 SN-38 glucuronidation have been identified, other basis have yet to be identified. 
Therefore, there remains a need for improved methods and compositions for evaluating 
polymorphisms in one or both UGTlAl genes of a patient and correlating a genotype 
with adverse effects of various therapies. 

The present invention is based on the fact that genetic variation is correlated with 

10 UGTlAl expression and has several important clinical implications. The improved 
methods and compositions of the present invention may be used in determining if a 
treatment has a propensity to adversely affect a patient or what treatment may be 
appropriate or inappropriate for a particular patient. UGTlAl basal transcription is 
affected by a polymorphic (TA) repeat (see FIG. 1 legend in Innocenti et al, 2002), in 

15 addition to a phenobarbital-responsive enhancer module (PBREM) that contains variants 
affecting inducible gene expression, as described herein. A "polymorphism" or "genetic 
polymorphism," as referred to herein, is the existence of two or more variant forms of a 
particular characteristic, e.g., a single nucleotide or a repeat a nucleotide or nucleotides. 
Generally, variations are due to the addition, deletion, or substitution of one or more 

20 nucleotides at a site or a variation in the number of tandem repeats of a DNA sequence. 
In various embodiments, other polymorphisms within or outside the UGTl gene locus, 
see Genbank accession nimiber AF297093 which is incorporated herein by reference, 
may be used as long as an association of a polymorphism with a particular phenotype 
and/or haplotype can be established. Exemplary methods for genotyping a UGTl A gene 

25 may be found at least in U.S. Patents 6,479,236, 6,472, 1 57 and 6,395,48 1 , each of which 
is incorporated herein by reference. 

In various embodiments of the invention, significant linkage disequilibrium 
between a (TA) polymorphism and variants in the PBREM, or other variants within or 
outside the UGTl gene locus, indicates that patients possessing such other variants may 

30 be at risk of irinotecan toxicity. "Significant" as used in respect to linkage 
disequilibrium, as determined by one of skill in the art, is contemplated to be a statistical 
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p or a value that may be 0.25 or 0.1 and may be 0.1, 0.05. 0.001, 0.00001 or less. 
"Linkage disequilibrium" ("LD" as used herein, though also referred to as "LED" in the 
art) refers to a situation where a particular combination of alleles (i.e., a variant form of a 
given gene) or polymorphisms at two loci appears more frequently than would be 
5 expected by chance. The relationship between PBREM-(TA)n haplotypes and the 
glucuronidation rate of the UGTlAl substrate SN-38 may be used to correlate the 
genotype {i.e., the genetic make up of an organism) to a phentoype {i.e., the physical 
traits displayed by an organism or cell). "Haplotype" is used herein to refer to a 
collective genotype of two or more closely linked loci. Each haplotype defines the 
10 sequence of alleles or polymorphisms along one of the homologous chromosomes. In 
some embodiments, the polymorphisms may be 0.001, 0.01, 0.1, 0.2 cM or more within 
one another. 

Various embodiments of the invention include methods for evaluating the risk of 
toxicity fi-om irinotecan, or other UGTlAl substrates, in a patient. A polymorphism may 

15 be a single nucleotide polymorphism (SNP) and may be in linkage disequilibrium with a 
(TA)n repeat. In certain embodiments, the methods include detecting one or more 
polymorphisms in one or both copies of the UGTlAl gene and/or one or both copies of 
any other gene located in the UGTl gene locus of a patient. In particular embodiments a 
promoter polymorphism is detected. It is specifically contemplated that methods and 

20 compositions of the invention may be implemented to determine whether UGTlAl 
polymorphisms are present or absent in one or both alleles. 

In certain embodiments, a polymorphism may be a polymorphism that affects the 
transcription of UGTlAl, such as in the promoter region or 5' flanking region that affects 
transcription (which includes the promoter region), and in particular a polymorphism at 

25 nucleotide position -3440, -3401, -3279, -3177, -3175, or -3156 fi-om the UGTlAl gene 
transcriptional start site, which is designated +1 with no nucleotide designated as 0. The 
number of TA repeats can be 5, 6, 7, 8 or more TA repeats. In particular embodiments, 
the polymorphism is the following: -34400 A, -340 1 T>C, -32790T, -3 1 770G, - 
3175A>G, -31560A, or any combination thereof The notation -3440OA, for example 

30 indicates that cytosine nucleotide (C) at the -3440 position is replaced by an Adenosine 
(A). 
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Methods of the invention may include obtaining a nucleic acid sample from a 
patient and detecting one or more polymorphisms in the UGTlAl gene using various 
methods. In certain embodiments, polymorphism detection may include amplifying a 
nucleic acid containing all or part of a particular region of the UGTlAl gene to obtain 
5 amplification products; and/or analyzing the amplification products for the presence or 
absence of one or more polymorphisms. Other methods of polymorphism detection 
known in the art are also contemplated. 

In certain embodiments, a promoter polymorphism of a UGTlAl gene may be 
detected by performing one of a variety of known assays. These may include, but are not 

10 limited to hybridization assays, sequencing or microsequencing assays, allele-specific 
amplification assays or any other methods known for detecting nucleic acid 
polymorphisms, which may or may not include amplification of a nucleic acid. It is 
understood that "detecting" a polymorphism includes identifying the nucleotide sequence 
at that site and/or determining whether the polymorphism is present or absent. 

15 A correlation between one or more polymorphisms and the glucoronidation rate 

of irinotecan or other substrates of UGTlAl, including but not limited to bilirubin, 
estriol, beta-estradiol, 2-hydroxyestriol, 2-hydroxyestrone, 2-hydroxyestradiol, thyroxine 
(T4), rT3, octyl gallate, propyl gallate, anthraflavic acid, quercitin, fisetin, naringenin, 1- 
naphtol, and ethynylestradiol, may be used to determine various aspects of a treatment 

20 regime, including irinotecan and/or other drugs or compoxmds metabolized directly or 
indirectly by UGTlAl. In some embodiments the methods also include analyzing the 
glucuronidation rate associated with the various polymorphisms and polymorphism 
combinations, for exemplary methods and compositions related to analysis of 
glucuronidation rates see U.S. Patent 6,319,678, which is incorporated herein by 

25 reference. The methods may also include determining the biliary transport capacity of 
the patient. In particular embodiments the evaluation of the promoter polymorphism may 
be used to optimize the dose of irinotecan or other compounds for treatment of a patient 
or to reduce their toxicity. 

The methods of the invention may further include treating a patient by 

30 administering to the patient irinotecan in combination with other pharmaceutical agents at 
appropriate dosages, such that the toxicity of irinotecan or other substrates of UGTlAl 
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are reduced. In particular embodiments, a second agent that reduces excretion of an 
active irinotecan species through the bile may administered in conjunction with 
irinotecan based upon determinations made using methods and compositions of the 
invention, for related methods and compositions see U.S. Patents 6,407,117, 6,287,834 
5 and 5,786,344, each of which is incorporated herein by reference. 

The present invention is also based on the observation that the nucleotide at 
position [[-3516]] -3156 in the UGTlAl upstream region is correlated with irinotecan 
toxicity. An A at that position positively correlates with irinotecan toxicity while a G at 
that position correlates with tolerance to irinotecan. Thus, the present invention concerns 

10 methods and compositions for evaluating, predicting, and determining whether a patient 
will experience toxicity from irinotecan. Toxicity from irinotecan evidences itself as side 
effects from the drug, which are well known to oncologists and their patients. 

In some embodiments of the invention, there are methods of predicting whether a 
patient may suffer or be subject to toxicity from irinotecan if given it involving 

15 determining the nucleic acid sequence of base [[-3516]] -3156 in the UGTlAl 5' 
flanking region in one or both alleles of the patient. The presence of an A nucleotide 
indicates the person is at risk for irinotecan toxicity. An AA genotype is more closely 
correlated with grade 4 neutropenia than other genotypes at that position. Moreover, in 
some embodiments, this is unrelated to the genotype of the TA indel in the UGTlAl 

20 promoter. It is contemplated that these methods concerning the indel at position [[- 
3516]] -3156 in the UGTlAl 5' flanking region can be implemented with methods 
involving determining one or more other polymorphisms in the UGTlAl 5' flanking 
region of the same patient. 

Consequently, if a person is identified as at risk for irinotecan toxicity based on 

25 any of the embodiments discussed herein, an alternative course of therapy or a lower dose 
of irinotecan than is normally given may be contemplated. In addition, methods also 
include determining the sequence of other polymorphisms or indels (insertion/deletions) 
in linkage disequilibrium (LD) with the [[-3516]] -3156 variant. Therefore, in some 
embodiments of the invention, the TA indel is evaluated to determine the number of 

30 repeats. Also, any other variant in UGTlAl or any other gene (the term "gene" includes 
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non-coding regions that affect the expression or activity level of the encoded polypeptide) 
may be evaluated for variants in LD with the [[-3516]] -3156 variant. 

Various embodiments may include a kit for evaluating the risk of irinotecan 
toxicity in a patient. The kit may include a variety of containers, reagents and the like. 
5 In certain embodiments, the kit may include an oligonucleotide primer to amplify a 
promoter region of a UGTlAl gene or genes, haplotype tag SNPs or allele specific 
amplification primers of the UGTlAl gene or any other primer within the UGTl gene 
locus. The haplotype tag SNPs or allele specific primers may be used to amplify a 
polymorphism at one or more nucleotide positions of the UGTlAl gene or other UGTl 

10 locus gene. In particular embodiments, the nucleotide position may be at -3440, -3401, - 
3279, -3177, -3175, or -3156, or a combination thereof, from the UGTlAl gene 
transcriptional start site. The kit may include the haplotype tag SNPs or allele specific 
amplification primers in multi-well assay plate. The kit may also include haplotype tag 
SNPs or allele specific hybridization probes for a variety of promoter polymorphisms. 

15 The haplotype tag SNPs or allele specific hybridization probes may detect 
polymorphisms at nucleotide position -3440, -3401, -3279, -3177, -3175, or -3156 from 
the UGTlAl gene transcriptional start site. The kit may include haplotype tag SNPs or 
allele specific hybridization probes comprised in an oligonucleotide array or microarray. 
Compositions of the invention include nucleic acids that can be used to determine 

20 the sequence at position [[-3516]] -3156 of UGTIA or other reagents in that regard. 
Arrays and other assays for screening multiple samples are also included as part of the 
invention. Such compositions may be incorporated into kits or as part of a kit, along with 
any other composition discussed herein. 

It is contemplated that any method or composition described herein can be 

25 implemented with respect to any other method or composition described herein. 
Similarly, any embodiment discussed with respect to one aspect of the invention may be 
used in the context of any other aspect of the invention. 

Throughout this application, the term "about" is used to indicate that a value 
includes the standard deviation of error for the device or method being employed to 

30 determine the value. 
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The use of the word "a" or "an" when used in conjunction with the term 
"comprising" in the claims and/or the specification may mean "one," but it is also 
consistent with the meaning of "one or more," "at least one," and "one or more than one." 

Other objects, features and advantages of the present invention will become 
5 apparent from the following detailed description. It should be understood, however, that 
the detailed description and the specific examples, while indicating specific embodiments 
of the invention, are given by way of illustration only, since various changes and 
modifications within the spirit and scope of the invention will become apparent to those 
skilled in the art from this detailed description. 

10 

BRIEF DESCRIPTION OF THE DRAWINGS 

The following drawings form part of the present specification and are included to 
fiarther demonstrate certain aspects of the present invention. The invention may be better 
15 understood by reference to one or more of these drawings in combination with the 
detailed description of specific embodiments presented herein. 

FIG. 1 illustrates an exemplary phenobarbital-responsive enhancer module 
(PBREM) and description of polymorphisms VSEO ID NO: MX The previously described 

20 PBREM domains are underlined with the NR half-site sequences shown in bold. The 
polymorphic sites of the present application are included. The variants found at these 
sites are also listed. Positions indicated are from the first base of the UGTIA cluster 
sequence (Genbank accession No. AF297093) (SEO ID NO:!) . 

FIG, 2A-2D illustrates (TA)n genotype-phenotype relationship in human livers. 

25 (a) correlation in all samples investigated (n=83) (b) correlation in Caucasians (n=56) (c) 
correlation in African-Americans (n=15) (d) correlation in individuals of Asian (n=l), 
and unknown (n=10) ethnicity. Liver microsomes were phenotyped for SN-38 
glucvironidation rates in each liver with a single experiment performed in triplicate. Bars 
show the mean value of SN-38 glucuronidation rates in each group. 
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FIG. 3 illustrates an exemplary haplotype-phenotype relationship in human livers 
of Caucasian and African origin (n=70). Bars show the mean value of SN-38 
glucuronidation rates in each group. Only haplotypes with > 2 samples are shown. 

FIG. 4. Correlation between ANC and TA indel genotype. Bars represent the 
5 means. Nonparametric trend analysis (7/7<6/7<6/6, z = -2.72, p=0.01). 

FIG. 5. Pre-treatment total bilirubin levels and distribution of the -3156 
genotypes within each TA indel genotype. The -3156 AA genotypes are reported in 
squares, the GA genotypes in circles and the GO genotypes in triangles. Bars represent 
the mean values. A significant trend was reported (7/7>6/7>6/6, z=2.88, p<0.01, 
10 nonparametric trend analysis). 

FIG. 6. Correlation between ln(ANC nadir) and pretreatment total bilirubin 
levels. Patients with bilirubin levels less than 0.6 mg/dl are depicted in squares. Those 
with bilirubin levels higher than 0.7 mg/dl are depicted in circles. 

15 DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS 

The present invention provides improved methods and compositions for 
identifying the effects of various polymorphisms, promoter polymorphisms or any 
combination thereof on the expression of UGTlAl or the glucuronidation rate of 
UGTl Al for the evaluation of the potential or risk for irinotecan toxicity in an individual 

20 or patient. The development of these improved methods and compositions allows for the 
use of such an evaluation to optimize treatment of a patient and to lower the risk of 
toxicity. In certain aspects of the invention various combinations of promoter 
polymorphisms may be used in this evaluation, in particular, polymorphisms in the 
PBREM region and polymorphisms in the TA repeats may be used. 

25 Genetic variation in UGTlAl expression has several important clinical 

implications. UGTlAl basal transcription is affected by a polymorphic (TA) repeat. 
Another important regulatory element is the phenobarbital-responsive enhancer module 
(PBREM) which may contain variants affecting inducible gene expression. The 
examples provided herein study the extent of linkage disequilibrium between the (TA) 

30 polymorphism and variants in the PBREM and UGTlAl promoter. The relationship 
between PBREM-(TA)n haplotypes and the glucuronidation rate of the UGTlAl substrate 
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SN-38 is also addressed herein. Studies described in the Examples illustrate that SN-38G 
formation rates were correlated with (TA) genotypes and promoter variants. In vaious 
aspects particular (TA) variants are in linkage disequilibrium with various other 

polymorphisms. 

5 Certain aspects of the invention are based on, but not limited to, the observation 

and characterization of novel polymorphisms in the PBREM region of the UGTlAl gene. 
Due to the clinical implications of genetically modified regulation of UGTlAl activity, 
the PBREM region was sequenced and polymorphisms in the TATA box of the UGTlAl 
promoter genotyped, as described in the Examples section below. 

10 

I. HEPATIC GLUCURONIDATION BY UGT ENZYMES 

Hepatic glucuronidation results from the activities of a multigene family of UGT 
enzymes, the members of which exhibit specificity for a variety of endogenous substrates 
and xenobiotics. The UGT enzymes are broadly classified into two distinct gene 

15 families. The UGTl locus codes for multiple isoforms of UGT, all of which share a C- 
terminus encoded by a imique set of exons 2-5, but which have a variable N-terminus 
encoded by different first exons, each with its own independent promoter (Bosma et al, 
1992; Ritter et al., 1992). The variable first exons confer the substrate specificity of the 
enzyme. Isoforms of the UGT2 family are unique gene products of which at least eight 

20 isozymes have been identified (Clarke et al. Handbook of Experimental Pharmacology 
1994). The UGTlAl isoform is the major bilirubin glucuronidation enzyme. Genetic 
defects in the UGTlAl gene can result in decreased glucuronidation activity which leads 
to abnormally high levels of unconjugated serum bilirubin that may enter the brain and 
cause encephalopathy and kemicterus (Owens & Ritter, 1995). This condition is 

25 commonly known as Gilbert's syndrome. The molecular defect in Gilbert's Syndrome is 
a change in the TATA box within the UGTlAl promoter (Bosma et al., 1995 and 
Monaghan et al., 1996). This promoter usually contains a (TA)6 TAA element, but 
another allele, termed UGTlAl *28 or allele 7, is also present in human populations at 
high fi-equencies, and contains the sequence (TA)^ TAA. This polymorphism in the 

30 promoter of the UGTlAl gene results in reduced expression of the gene and accounts for 
most cases of Gilbert's Syndrome (Bosma et al., 1995). Overall, gene expression levels 
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for the UGTlAl promoter alleles are inversely related to the length of the TA repeat in 
the TATA box. 

The variation observed in this promoter may also account for the inter-individual 
and inter-ethnic variation in drug metabolism and response to xenobiotic exposure. 
5 UGTs have been shown to contribute to the detoxification and elimination of both 
exogenous and endogenous compounds. For example, one typical role of the UGTlAl 
isoform is the glucuronidation of SN-38 (7-ethyl-lO-hydroxycamptothecin) to the 
corresponding glucuronide (lO-O-glucuronyl-SN-38, SN-38G) as well as the 
glucuronidation of TAS-103 (6-[[2-(dimethylamino)ethyl]amino]-3-hydroxy-7H- 

10 indeno[2,l-c]quinoline-7-one dihydrochloride) to its corresponding glucuronide (TAS- 
103G). SN-38 is the active form of irinotecan (CPT-11, 7-ethyl-10-[4-(l-piperidino)-l- 
piperidino]carbonyloxycamptothecin), which is a camptothecin derivative used in the 
treatment of metastatic colorectal cancer and other malignancies. The metabolism of SN- 
38 and TAS-103 (also known as flavopiridol) are merely illustrative of the present 

15 invention the metabolism of other UGTlAl substrates is also contemplated, such as 
estradiol, bilirubin, simple phenols, flavones, CI 8 steroids, complex phenols and 
coumarins. 

Irinotecan is biotransformed by tissue and serum carboxylesterases to an active 
metabolite, SN-38, which has a 100-1,000-fold higher antitumor activity than irinotecan. 

20 SN-38 is glucuronidated by hepatic uridine diphosphate glucuronosyltransferases (UGTs) 
to form SN-38 glucuronide (lOO-glucuronyl-SN-38, SN-38G), which is inactive and 
excreted into the bile and urine although, SN-38G might be deconjugated to form SN-38 
by intestinal p -glucuronidase enzyme (Kaneda et al, 1990). 

The major dose-limiting toxicities of irinotecan include diarrhea and, to a lesser 

25 extent, myelosuppression. irinotecan-induced diarrhea can be serious and often does not 
respond adeqmtely to conventional antidiarrheal agents (Takasuna et al., 1995). This 
diarrhea may be due to direct enteric injury caused by the active metabolite, SN-38, 
which has been shown to accvimulate in the intestine after intra peritoneal administration 
of irinotecan in athymic mice (Araki et al, 1993). The resuhs of a recently completed 

30 phase I clinical trial, demonstrated that there was an inverse relationship between SN-38 
glucuronidation rates and severity of diarrheal incidences in patients treated with 
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increasing doses of Irinotecan (Gupta et al, 1994). These findings indicate that 
glucuronidation of SN-38 protects against Irinotecan-induced gastrointestinal toxicity. A 
complete discussion of the correlation between diarrhea and SN-38 glucuronidation, as 
well as a description of biochemical methods for determining glucurondation levels can 
5 be found in US Patent 5,786,344 and W096/01 127 which are both incorporated herein by 
reference in their entirety. Likewise, the results of studies using TAS-103 demonstrate 
that glucuronidation of TAS-103 may protect against TAS-103 induced toxicity. 
Therefore, the conversion of these two toxic compounds by hepatic UGTs demonstrates 
the importance of monitoring glucuronidation activity as an indicator of susceptibility to 

10 toxicity caused by exposure to compounds that are metabolized by UGTs. Furthermore, 
differential rates of SN-38 glucuronidation among subjects may explain the considerable 
inter-individual variation in the pharmacokinetic parameter estimates and toxicities 
observed after treatment with anti-cancer drugs or exposure to xenobiotics (Gupta et al, 
1994; Gupta a/., 1997). 

15 When two species, Gunn rats (Gvmn, 1938) and CN-1 patients, that are deficient 

in UGTIA isoforms were screened for TAS-103 and SN-38 glucuronidation activity, 
there was approximately an 80% lower glucuronidation rate of TAS-103 in vitro and no 
in vitro glucuronidation of SN-38 compared to healthy liver donors. These results 
demonstrate the role of the UGT 1 family in catalyzing SN-38 and TAS-103 conjugation. 

20 Furthermore, these results demonstrate that the UGT2 family does not play a role in the 
glucuronidation of SN-38. On the other hand, while isoforms of the UGTl family are the 
predominate isoforms involved in TAS-103 glucuronidation, the isoform of the UGT2 
family may also participate in TAS-103 glucuronidation. Failure to glucuronidate SN-38 
and TAS-103 in these instances may result specifically fi-om the genetic defect in UGTl 

25 gene family. 

Other experiments confirm the association between the UGTl A 1 isoform and SN- 
38 and TAS-103 glucuronidation. These studies show that substantial genetic variability 
exists in the UGTl A 1 isoform family and particularly in the UGTl A 1 promoter. This 

genetic variability has been shown to correlate with gene expression. For example, the 
30 presence of the 5 allele in the UGTlAl promoter leads to increased gene expression while 
the presence of the 8 allele leads to reduced gene expression. Differences in gene 
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expression levels may give rise to individuals with varying abilities to glucuronidate 
compounds metabolized by UGTs. This prediction was confirmed through a correlation 
analysis of UGTlAl promoter genotype and rate of in vitro SN-38 and TAS-103 
glucuronidation. 

5 It follows therefore that individuals with the 8 allele may also have differing 

susceptibility to xenobiotics when compared to other genotypes when those compounds 
are metabolized by UGTlAls. On the other hand, the presence of the 5 allele that 
correlates with increased gene expression and higher glucuronidation activity may result 
in the administration of less than optimum drug dosages. For example, when a drug 

10 metabolized by UGTlAls is administered to an individual with this polymorphism, the 
increased glucuronidation activity may cause more of the drug to be converted into the 
inactive metabolite in a shorter period, thereby reducing the drug's effectiveness. 
Conversely, in the rare case of drugs and xenobiotics that require glucuronidation for 
activation, decreased glucuronidation activity may cause less of the activated form of the 

15 drug or xenobiotic to be available. 

The fact that repeated sequences are intrinsically unstable and tend to lengthen 
and shorten as a result of unequal crossing-over during meiosis may explain the presence 
of other alleles, in addition to (TA)6 and (TA)?, in the population. Two additional alleles 
have been identified in hiraian populations: allele 5, containing the sequence (TA)5 TAA 

20 and allele 8, containing the sequence (TA)8 TAA, see U.S. Patent 6,395,481, which is 
incorporated in its entirety by reference. Interestingly, alleles 5 and 8 were found 
predominantly in population samples from Sub-Saharan Africa, where they occur at 
lower frequencies than the common alleles 6 and 7 although it is possible that these two 
alleles are present across a variety of ethnic groups. The frequency of alleles 6 and 7 also 

25 appears to differ significantly across ethnic groups, with Asian and Amerindian 
populations showing the highest frequencies of allele 6. Conversely, alleles 6 and 7 occur 
at intermediate and similar frequencies among Caucasians and Sub-Saharan Africans. 

Several hypotheses may be proposed regarding the selective pressures that might 
be responsible for the observed pattern of inter-population variation at the UGTlAl 

30 promoter. It was previously proposed that the maintenance of intermediate levels of 
bilirubin is adaptive (Beutler et al, 1998), and that the alleles at this promoter would be 

55142555.1 

-15- 



maintained in the population by balancing selection. This hypothesis is based on the 
observation that bilirubin is a potent antioxidant likely to have physiological significance 
(Stocker et al, 1987). Hov/ever, it is also known that glucuronidation is an important 
detoxification step for many endogenous as well as exogenous compounds (Clarke & 
5 Burchell, 1994). In addition to TAS-103 and SN-38, UGTlAl is likely to act on other 
substrates present in the environment, e.g., dietary components, environmental pollutants 
and carcinogens, which require detoxification as well as playing a role in the metabolism 
of bilirubin and other endogenous compounds. Within this framework, maintaining high 
levels of UGTlAl gene expression would ensure rapid elimination of toxic or 

10 endogenous compounds and be advantageous. 

As described herein, the correlation between in vitro glucuronidation rate and 
UGTlAl promoter polymorphism found for alleles 6 and 7 has been shown to extend to 
alleles 5 and 8. Because these alleles appear to be more frequent in subsets of human 
populations (for example, those of African origin), an even higher inter-individual 

15 variability in SN-38 and TAS-103 metabolism might be expected within these 
populations. Because the inverse relationship between TA repeat size and rate of SN-38 
glucuronidation extends to alleles 5 and 8, a screening assay that identifies these alleles 
can facilitate individualization of drug therapy, identify individuals susceptible to 
xenobiotic exposure, and can be used to improve drug dosage calculations. 

20 

II. NUCLEIC ACIDS 

Certain embodiments of the present invention concern various nucleic acids, 
including promoters, amplification primers, oligonucleotide probes and other nucleic acid 
elements involved in the analysis of genomic DNA. In certain aspects, a nucleic acid 

25 comprises a wild-type, a mutant, or a polymorphic nucleic acid. 

The term "nucleic acid" is well known in the art. A "nucleic acid" as used herein 
will generally refer to a molecule {i.e., a strand) of DNA, RNA or a derivative or analog 
thereof, comprising a nucleobase. A nucleobase includes, for example, a naturally 
occurring purine or pyrimidine base found in DNA {e.g., an adenine "A," a guanine "G," 

30 a thymine "T" or a cytosine "C") or RNA {e.g., an A, a G, an uracil "U" or a C). The 
term "nucleic acid" encompass the terms "oligonucleotide" and "polynucleotide," each as 
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a subgenus of the terai "nucleic acid." The term "oligonucleotide" refers to a molecule of 
between about 3 and about 100 nucleobases in length. The term "polynucleotide" refers 
to at least one molecule of greater than about 100 nucleobases in length. A "gene" refers 
to coding sequence of a gene product, as well as introns and the promoter of the gene 
5 product. In addition to the UGTlAl gene, other regulatory regions such as enhancers for 
UGTlAl are contemplated as nucleic acids for use with compositions and methods of the 
claimed invention. 

These definitions generally refer to a single-stranded molecule, but in specific 
embodiments will also encompass an additional strand that is partially, substantially or 

10 fully complementary to the single-stranded molecule. Thus, a nucleic acid may 
encompass a double-stranded molecule or a triple-stranded molecule that comprises one 
or more complementary strand(s) or "complement(s)" of a particular sequence 
comprising a molecule. As used herein, a single stranded nucleic acid may be denoted by 
the prefix "ss", a double stranded nucleic acid by the prefix "ds", and a triple stranded 

1 5 nucleic acid by the prefix "ts." 

In particular aspects, a nucleic acid encodes a protein, polypeptide, or peptide. In 
certain embodiments, the present invention concerns novel compositions comprising at 
least one proteinaceous molecvde. As used herein, a "proteinaceous molecule," 
"proteinaceous composition," "proteinaceous compound," "proteinaceous chain," or 

20 "proteinaceous material" generally refers, but is not limited to, a protein of greater than 
about 200 amino acids or the fiiU length endogenous sequence translated from a gene; a 
polypeptide of greater than about 100 amino acids; and/or a peptide of from about 3 to 
about 100 amino acids. All the "proteinaceous" terms described above may be used 
interchangeably herein. 

25 1. Preparation of Nucleic Acids 

A nucleic acid may be made by any technique known to one of ordinary skill in 
the art, such as for example, chemical synthesis, enzymatic production or biological 
production. Non-limiting examples of a synthetic nucleic acid (e.g., a synthetic 
oligonucleotide), include a nucleic acid made by in vitro chemical synthesis using 

30 phosphotriester, phosphite or phosphoramidite chemistry and solid phase techniques such 
as described in Eiiropean Patent 266,032, incorporated herein by reference, or via 
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deoxynucleoside H-phosphonate intermediates as described by Froehlere^ al, 1986 and 
U.S. Patent 5,705,629, each incorporated herein by reference. In the methods of the 
present invention, one or more oligonucleotide may be used. Various different 
mechanisms of oligonucleotide synthesis have been disclosed in for example, U.S. 
5 Patents 4,659,774, 4,816,571, 5,141,813, 5,264,566, 4,959,463, 5,428,148, 5,554,744, 
5,574,146, 5,602,244, each of which is incorporated herein by reference. 

A non-limiting example of an enzymatically produced nucleic acid include one 
produced by enzymes in amplification reactions such as PCR^m (see for example, U.S. 
Patent 4,683,202 and U.S. Patent 4,682,195, each incorporated herein by reference), or 

10 the synthesis of an oligonucleotide described in U.S. Patent 5,645,897, incorporated 
herein by reference. A non-limiting example of a biologically produced nucleic acid 
includes a recombinant nucleic acid produced {i.e., replicated) in a living cell, such as a 
recombinant DNA vector replicated in bacteria (see for example, Sambrook etal. 2001, 
incorporated herein by reference). 

15 2. Purification of Nucleic Acids 

A nucleic acid may be purified on polyacrylamide gels, cesium chloride 
centrifugation gradients, chromatography columns or by any other means known to one 
of ordinary skill in the art (see for example, Sambrook etal, 2001, incorporated herein 
by reference). In some aspects, a nucleic acid is a pharmacologically acceptable nucleic 

20 acid. Pharmacologically acceptable compositions are known to those of skill in the art, 
and are described herein. 

In certain aspects, the present invention concerns a nucleic acid that is an isolated 
nucleic acid. As used herein, the term "isolated nucleic acid" refers to a nucleic acid 
molecule {e.g., an RNA or DNA molecule) that has been isolated free of, or is otherwise 

25 free of, the bulk of the total genomic and transcribed nucleic acids of one or more cells. 
In certain embodiments, "isolated nucleic acid" refers to a nucleic acid that has been 
isolated free of, or is otherwise free of, bulk of cellular components or in vitro reaction 
components such as for example, macromolecules such as lipids or proteins, small 
biological molecules, and the like. 

30 3. Nucleic Acid Segments 
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In certain embodiments, the nucleic acid is a nucleic acid segment. As used 
herein, the term "nucleic acid segment," are fragments of a nucleic acid, such as, for a 
non-limiting example, those that encode only part of a UGTl gene locus or a UGTlAl 
gene sequence. Thus, a "nucleic acid segment" may comprise any part of a gene 
5 sequence, including from about 2 nucleotides to the frill length gene including promoter 
regions to the polyadenylation signal and any length that includes all the coding region. 

Various nucleic acid segments may be designed based on a particular nucleic acid 
sequence, and may be of any length. By assigning numeric values to a sequence, for 
example, the first residue is 1, the second residue is 2, etc., an algorithm defining all 
10 nucleic acid segments can be created: 

n to n + y 

where n is an integer from 1 to the last number of the sequence and y is the length of the 
nucleic acid segment minus one, where n + y does not exceed the last number of the 
sequence. Thus, for a lO-mer, the nucleic acid segments correspond to bases 1 to 10, 2 to 

15 11, 3 to 12 ... and so on. For a 15-mer, the nucleic acid segments correspond to bases 1 to 
15, 2 to 16, 3 to 17 ... and so on. For a 20-mer, the nucleic segments correspond to bases 
1 to 20, 2 to 21, 3 to 22 ... and so on. In certain embodiments, the nucleic acid segment 
may be a probe or primer. As used herein, a "probe" generally refers to a nucleic acid 
used in a detection method or composition. As used herein, a "primer" generally refers to 

20 a nucleic acid used in an extension or amplification method or composition. 
4. Nucleic Acid Complements 

The present invention also encompasses a nucleic acid that is complementary to a 
nucleic acid. A nucleic acid is "complement(s)" or is "complementary" to another 
nucleic acid when it is capable of base-pairing with another nucleic acid according to the 
25 standard Watson-Crick, Hoogsteen or reverse Hoogsteen binding complementarity rules. 
As used herein "another nucleic acid" may refer to a separate molecule or a spatial 
separated sequence of the same molecule. In preferred embodiments, a complement is a 
hybridization probe or amplification primer for the detection of a nucleic acid 
polymorphism. 

30 As used herein, the term "complementary" or "complement" also refers to a 

nucleic acid comprising a sequence of consecutive nucleobases or semiconsecutive 
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nucleobases (e.g., one or more nucleobase moieties are not present in the molecule) 
capable of hybridizing to another nucleic acid strand or duplex even if less than all the 
nucleobases do not base pair with a counterpart nucleobase. However, in some 
diagnostic or detection embodiments, completely complementary nucleic acids are 
5 preferred. 

III. NUCLEIC ACID DETECTION 

Some embodiments of the invention concern identifying polymorphisms in 
UGTlAl, correlating genotype or haplotype to phenotype, wherein the phenotype is 

10 lowered or altered UGTlAl activity or expression, and then identifying such 
polymorphisms in patients who have or will be given irinotecan or related drugs or 
compounds. Thus, the present invention involves assays for identifying polymorphisms 
and other nucleic acid detection methods. Nucleic acids, therefore, have utility as probes 
or primers for embodiments involving nucleic acid hybridization. They may be used in 

15 diagnostic or screening methods of the present invention. Detection of nucleic acids 
encoding UGTlAl, as well as nucleic acids involved in the expression or stability of 
UGTlAl polypeptides or transcripts, are encompassed by the invention. General 
methods of nucleic acid detection methods are provided below, followed by specific 
examples employed for the identification of polymorphisms, including single nucleotide 

20 polymorphisms (SNPs). 

A. Hybridization 

The use of a probe or primer of between 3, 4, 5, 6, 7, 8,9, 10, 11, 12, 13, 14, or 15 
and 50, 60, 70, 80, 90, or 100 nucleotides, preferably between 17 and 100 nucleotides in 
length, or in some aspects of the invention up to 1-2 kilobases or more in length, allows 

25 the formation of a duplex molecule that is both stable and selective. Molecules having 
complementary sequences over contiguous stretches greater than 20 bases in length are 
generally preferred, to increase stability and/or selectivity of the hybrid molecules 
obtained. One will generally prefer to design nucleic acid molecules for hybridization 
having one or more complementary sequences of 20 to 30 nucleotides, or even longer 

30 where desired. Such fragments may be readily prepared, for example, by directly 
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synthesizing the fragment by chemical means or by introducing selected sequences into 
recombinant vectors for recombinant production. 

Accordingly, the nucleotide sequences of the invention may be used for their 
ability to selectively form duplex molecules with complementary stretches of DNAs 
5 and/or RNAs or to provide primers for amplification of DNA or RNA from samples. 
Depending on the application envisioned, one would desire to employ varying conditions 
of hybridization to achieve varying degrees of selectivity of the probe or primers for the 
target sequence. 

For applications requiring high selectivity, one will typically desire to employ 

10 relatively high stringency conditions to form the hybrids. For example, relatively low 
sah and/or high temperature conditions, such as provided by about 0.02 M to about 0.10 
M NaCl at temperatures of about 50°C to about 70°C. Such high stringency conditions 
tolerate little, if any, mismatch between the probe or primers and the template or target 
strand and would be particularly suitable for isolating specific genes or for detecting a 

15 specific polymorphism. It is generally appreciated that conditions can be rendered more 
stringent by the addition of increasing amounts of formamide. For example, under highly 
stringent conditions, hybridization to filter-bound DNA may be carried out in 0.5 M 
NaHP04, 7% sodium dodecyl sulfate (SDS), 1 mM EDTA at 65°C, and washing in 0.1 x 
SSC/0.1% SDS at 68°C (Ausubel et al, 1989). 

20 Conditions may be rendered less stringent by increasing salt concentration and/or 

decreasing temperature. For example, a medium stringency condition could be provided 
by about 0.1 to 0.25M NaCl at temperatures of about 37°C to about 55°C, while a low 
stringency condition could be provided by about 0.1 5M to about 0.9M salt, at 
temperatures ranging from about 20°C to about 55°C. Under low stringent conditions, 

25 such as moderately stringent conditions the washing may be carried out for example in 
0.2 X SSC/0.1% SDS at 42°C (Ausubel et al, 1989). Hybridization conditions can be 
readily manipulated depending on the desired results. 

In other embodiments, hybridization may be achieved under conditions of, for 
example, 50mM Tris-HCl (pH 8.3), 75mM KCl, 3mM MgClj, l.OmM dithiothreitol, at 

30 temperatures between approximately 20°C to about 37°C. Other hybridization conditions 
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utilized could include approximately lOmM Tris-HCl (pH 8.3), 50mM KCl, 1.5mM 
MgCl2, at temperatures ranging from approximately 40°C to about 72°C. 

In certain embodiments, it will be advantageous to employ nucleic acids of 
defined sequences of the present invention in combination with an appropriate means, 
5 such as a label, for determining hybridization. A wide variety of appropriate indicator 
means are known in the art, including fluorescent, radioactive, enzymatic or other 
ligands, such as avidin/biotin, which are capable of being detected. In preferred 
embodiments, one may desire to employ a fluorescent label or an enzyme tag such as 
urease, alkaline phosphatase or peroxidase, instead of radioactive or other 

10 environmentally undesirable reagents. In the case of enzyme tags, colorimetric indicator 
substrates are known that can be employed to provide a detection means that is visibly or 
spectrophotometrically detectable, to identify specific hybridization with complementary 
nucleic acid containing samples. In other aspects, a particular nuclease cleavage site may 
be present and detection of a particular nucleotide sequence can be determined by the 

15 presence or absence of nucleic acid cleavage. 

In general, it is envisioned that the probes or primers described herein will be 
useful as reagents in solution hybridization, as in PGR, for detection of expression or 
genotype of corresponding genes, as well as in embodiments employing a solid phase. In 
embodiments involving a solid phase, the test DNA (or RNA) is adsorbed or otherwise 

20 affixed to a selected matrix or surface. This fixed, single-stranded nucleic acid is then 
subjected to hybridization vnth. selected probes under desired conditions. The conditions 
selected will depend on the particular circumstances (depending, for example, on the 
G+C content, type of target nucleic acid, source of nucleic acid, size of hybridization 
probe, etc.). Optimization of hybridization conditions for the particular application of 

25 interest is well known to those of skill in the art. After washing of the hybridized 
molecules to remove non-specifically bound probe molecules, hybridization is detected, 
and/or quantified, by determining the amount of bound label. Representative solid phase 
hybridization methods are disclosed in U.S. Patents 5,843,663, 5,900,481 and 5,919,626. 
Other methods of hybridization that may be used in the practice of the present invention 

30 are disclosed in U.S. Patents 5,849,481, 5,849,486 and 5,851,772. The relevant portions 
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of these and other references identified in this section of the Specification are 
incorporated herein by reference. 

B. Amplification of Nucleic Acids 

Nucleic acids used as a template for amplification may be isolated from cells, 
5 tissues or other samples according to standard methodologies (Sambrook et ah, 2001). In 
certain embodiments, analysis is performed on whole cell or tissue homogenates or 
biological fluid samples with or without substantial purification of the template nucleic 
acid. The nucleic acid may be genomic DNA or fractionated or whole cell RNA. Where 
RNA is used, it may be desired to first convert the RNA to a complementary DNA. 

10 The term "primer," as used herein, is meant to encompass any nucleic acid that is 

capable of priming the synthesis of a nascent nucleic acid in a template-dependent 
process. Typically, primers are oligonucleotides from ten to twenty and/or thirty base 
pairs in length, but longer sequences can be employed. Primers may be provided in 
double-stranded and/or single-stranded form, although the single-stranded form is 

15 preferred. 

Pairs of primers designed to selectively hybridize to nucleic acids corresponding 
to the UGTl gene locus (Genbank accession AF279093 AF297093 ), UGTlAl gene and/or 
SEQ ID N0:1 or variants thereof, and fragments thereof are contacted with the template 
nucleic acid under conditions that permit selective hybridization. SEQ ID N0:1 set forth 

20 a nucleotide sequence that includes a majority of the UGTl A 1 gene. SEQ ID N0:1 
includes nucleotides 169,831 to 187,313 of the UGTl gene locus with nucleotide 1645 of 
SEQ ID N0:1 corresponding to nucleotide -3565 from the transcriptional start of the 
UGTl A I gene, thus the transcriptional start is located at nucleotide 5212 of SEQ ID 
N0:1. Depending upon the desired application, high stringency hybridization conditions 

25 may be selected that will only allow hybridization to sequences that are completely 
complementary to the primers. In other embodiments, hybridization may occur under 
reduced stringency to allow for amplification of nucleic acids that contain one or more 
mismatches with the primer sequences. Once hybridized, the template-primer complex is 
contacted with one or more enzymes that facilitate template-dependent nucleic acid 

30 synthesis. Multiple rounds of amplification, also referred to as "cycles," are conducted 
imtil a sufficient amount of amplification product is produced. 
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The amplification product may be detected, analyzed or quantified. In certain 
applications, the detection may be performed by visual means. In certain applications, 
the detection may involve indirect identification of the product via chemiluminescence, 
radioactive scintigraphy of incorporated radiolabel or fluorescent label or even via a 
5 system using electrical and/or thermal impulse signals (Affymax technology; Bellus, 
1994). 

A number of template dependent processes are available to amplify the 
oligonucleotide sequences present in a given template sample. One of the best known 
amplification methods is the polymerase chain reaction (referred to as PGR'''**) which is 

10 described in detail in U.S. Patents 4,683,195, 4,683,202 and 4,800,159, and in Innis et al, 
1988, each of which is incorporated herein by reference in their entirety. 

Another method for amplification is ligase chain reaction ("LCR"), disclosed in 
European Application No. 320 308, incorporated herein by reference in its entirety. U.S. 
Patent 4,883,750 describes a method similar to LCR for binding probe pairs to a target 

1 5 sequence. A method based on PGR™ and oligonucleotide ligase assay (OLA) (described 
in further detail below), disclosed in U.S. Patent 5,912,148, may also be used. 

Alternative methods for amplification of target nucleic acid sequences that may 
be used in the practice of the present invention are disclosed in U.S. Patents 5,843,650, 
5,846,709, 5,846,783, 5,849,546, 5,849,497, 5,849,547, 5,858,652, 5,866,366, 5,916,776, 

20 5,922,574, 5,928,905, 5,928,906, 5,932,451, 5,935,825, 5,939,291 and 5,942,391, Great 
Britain Application 2 202 328, and in PGT Application PGT/US89/01025, each of which 
is incorporated herein by reference in its entirety. Qbeta Replicase, described in PCX 
Application PCT/US87/00880, may also be used as an amplification method in the 
present invention. 

25 An isothermal amplification method, in which restriction endonucleases and 

ligases are used to achieve the amplification of target molecules that contain nucleotide 
5'-[alpha-thio]-triphosphates in one strand of a restriction site may also be useful in the 
amplification of nucleic acids in the present invention (Walker et al, 1992). Strand 

Displacement Amplification (SDA), disclosed in U.S. Patent 5,916,779, is another 
30 method of carrying out isothermal amplification of nucleic acids which involves multiple 
rounds of strand displacement and synthesis, i.e., nick translation. 
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Other nucleic acid amplification procedures include transcription-based 
amplification systems (TAS), including nucleic acid sequence based amplification 
(NASBA) and 3SR (Kwoh et al, 1989; PCT Application WO 88/10315, incorporated 
herein by reference in their entirety). European Application 329 822 disclose a nucleic 
5 acid amplification process involving cyclically synthesizing single-stranded RNA 
("ssRNA"), ssDNA, and double-stranded DNA (dsDNA), which may be used in 
accordance with the present invention. 

PCT Application WO 89/06700 (incorporated herein by reference in its entirety) 
disclose a nucleic acid sequence amplification scheme based on the hybridization of a 

10 promoter region/primer sequence to a target single-stranded DNA ("ssDNA") followed 
by transcription of many RNA copies of the sequence. This scheme is not cyclic, i.e., 
new templates are not produced from the resultant RNA transcripts. Other amplification 
methods include "RACE" and "one-sided PCR" (Frohman, 1990; Ohara et al, 1989). 
C. Detection of Nucleic Acids 

15 Following any amplification, it may be desirable to separate the amplification 

product from the template and/or the excess primer. In one embodiment, amplification 
products are separated by agarose, agarose-acrylamide or polyacrylamide gel 
electrophoresis using standard methods (Sambrook et al, 2001). Separated amplification 
products may be cut out and eluted from the gel for further manipulation. Using low 

20 melting point agarose gels, the separated band may be removed by heating the gel, 
followed by extraction of the nucleic acid. 

Separation of nucleic acids may also be effected by spin columns and/or 
chromatographic techniques known in art. There are many kinds of chromatography 
which may be used in the practice of the present invention, including adsorption, 

25 partition, ion-exchange, hydroxylapatite, molecular sieve, reverse-phase, column, paper, 
thin-layer, and gas chromatography as well as HPLC. 

In certain embodiments, the amplification products are visualized, with or without 
separation. A typical visualization method involves staining of a gel with ethidium 
bromide and visualization of bands under UV light. Alternatively, if the amplification 

30 products are integrally labeled with radio- or fluorometrically-labeled nucleotides, the 
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separated amplification products can be exposed to x-ray film or visualized under the 
appropriate excitatory spectra. 

In one embodiment, following separation of amplification products, a labeled 
nucleic acid probe is brought into contact with the amplified marker sequence. The probe 

5 preferably is conjugated to a chromophore but may be radiolabeled. In another 
embodiment, the probe is conjugated to a binding partner, such as an antibody or biotin, 
or another binding partner carrying a detectable moiety. 

In particular embodiments, detection is by Southern blotting and hybridization 
with a labeled probe. The techniques involved in Southern blotting are well known to 

10 those of skill in the art (see Sambrook et al. , 2001). One example of the foregoing is 
described in U.S. Patent 5,279,721, incorporated by reference herein, which discloses an 
apparatus and method for the automated electirophoresis and transfer of nucleic acids. 
The apparatus permits electrophoresis and blotting without external manipulation of the 
gel and is ideally suited to carrying out methods according to the present invention. 

15 Other methods of nucleic acid detection that may be used in the practice of the 

instant invention are disclosed in U.S. Patents 5,840,873, 5,843,640, 5,843,651, 
5,846,708, 5,846,717, 5,846,726, 5,846,729, 5,849,487, 5,853,990, 5,853,992, 5,853,993, 
5,856,092, 5,861,244, 5,863,732, 5,863,753, 5,866,331, 5,905,024, 5,910,407, 5,912,124, 
5,912,145, 5,919,630, 5,925,517, 5,928,862, 5,928,869, 5,929,227, 5,932,413 and 

20 5,935,791 , each of which is incorporated herein by reference. 
D. Other Assays 

Other methods for genetic screening may be used within the scope of the present 
invention, for example, to detect mutations in genomic DNA, cDNA and/or RNA 
samples. Methods used to detect point mutations include denaturing gradient gel 

25 electi-ophoresis ("DGGE"), restriction firagment length polymorphism analysis ("RFLP"), 
chemical or enzymatic cleavage methods, direct sequencing of target regions amplified 
by PCRTM (see above), single-stirand conformation polymorphism analysis ("SSCP") and 
other methods well known in the art. 

One method of screening for point mutations is based on RNase cleavage of base 

30 pair mismatches in RNA/DNA or RNA/RNA heteroduplexes. As used herein, the term 
"mismatch" is defined as a region of one or more impaired or mispaired nucleotides in a 
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double-stranded RNA/RNA, RNA/DNA or DNA/DNA molecule. This definition thus 
includes mismatches due to insertion/deletion mutations, as well as smgle or multiple 
base point mutations. 

U.S. Patent 4,946,773 describes an RNase A mismatch cleavage assay that 
5 involves annealing single-stranded DNA or RNA test samples to an RNA probe, and 
subsequent treatment of the nucleic acid duplexes with RNase A. For the detection of 
mismatches, the single-stranded products of the RNase A treatment, electrophoretically 
separated according to size, are compared to similarly treated control duplexes. Samples 
containing smaller fragments (cleavage products) not seen in the control duplex are 

10 scored as positive. 

Other investigators have described the use of RNase I in mismatch assays. The 
use of RNase I for mismatch detection is described in literature from Promega Biotech. 
Promega markets a kit containing RNase I that is reported to cleave three out of four 
known mismatches. Others have described using the MutS protein or other DNA-repair 

1 5 enzymes for detection of single-base mismatches. 

Ahernative methods for detection of deletion, insertion or substitution mutations 
that may be used in the practice of the present invention are disclosed in U.S. Patents 
5,849,483, 5,851,770, 5,866,337, 5,925,525 and 5,928,870, each of which is incorporated 
herein by reference in its entirety. 

20 E. Specific Examples of SNP Screening Methods 

Spontaneous mutations that arise diuing the course of evolution in the genomes of 
organisms are often not immediately transmitted throughout all of the members of the 
species, thereby creating polymorphic alleles that co-exist in the species populations. 
Often polymorphisms are the cause of genetic diseases. Several classes of 

25 polymorphisms have been identified. For example, variable nucleotide type 
polymorphisms (VNTRs), arise fi-om spontaneous tandem duplications of di- or 
trinucleotide repeated motifs of nucleotides. If such variations alter the lengths of DNA 
fragments generated by restriction endonuclease cleavage, the variations are referred to as 
restriction fragment length polymorphisms (RFLPs). RFLPs are been widely used in 

30 human and animal genetic analyses. 
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Another class of polymorphisms are generated by the replacement of a single 
nucleotide. Such single nucleotide polymorphisms (SNPs) rarely resuh in changes in a 
restriction endonuclease site. Thus, SNPs are rarely detectable restriction fragment 
length analysis. SNPs are the most common genetic variations and occur once every 100 
5 to 300 bases and several SNP mutations have been found that affect a single nucleotide in 
a protein-encoding gene in a manner sufficient to actually cause a genetic disease. SNP 
diseases are exemplified by hemophilia, sickle-cell anemia, hereditary hemochromatosis, 
late-onset alzheimer disease etc. 

In context of the present invention, polymorphic mutations that affect the activity 

10 and/or levels of the UGTlAl gene products, which are responsible for the 
glucuronidation of irinotecan and other chemotherapeutic and xenobiotic agents, will be 
determined by a series of screening methods. One set of screening methods is aimed at 
identifying SNPs that affect the inducibility, activity and/or level of the UGTlAl gene 
products in in vitro or in vivo assays. The other set of screening methods will then be 

1 5 performed to screen an individual for the occurrence of the SNPs identified above. To do 
this, a sample (such as blood or other bodily fluid or tissue sample) will be taken from a 
patient for genotype analysis. The presence or absence of SNPs will determine the ability 
of the screened individuals to metabolize irinotecan and other chemotherapeutic agents 
that are metabolized by the UGTlAl gene products. According to methods provided by 

20 the invention, these results will be used to adjust and/or alter the dose of irinotecan or 
other agent administered to an individual in order to reduce drug side effects. 

SNPs can be the result of deletions, point mutations and insertions and in general 
any single base alteration, whatever the cause, can result in a SNP. The greater frequency 
of SNPs means that they can be more readily identified than the other classes of 

25 polymorphisms. The greater uniformity of their distribution permits the identification of 
SNPs "nearer" to a particular trait of interest. The combined effect of these two attributes 
makes SNPs extremely valuable. For example, if a particular trait {e.g., inability to 
efficiently metabolize irinotecan) reflects a mutation at a particular locus, then any 
polymorphism that is linked to the particular locus can be used to predict the probability 

30 that an individual will be exhibit that trait. 
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Several methods have been developed to screen polymorphisms and some 
examples are listed below. The reference of Kwok and Chen (2003) and Kwok (2001) 
provide overviews of some of these methods; both of these references are specifically 
incorporated by reference. 
5 SNPs relating to glucuronidation of chemotherapeutic agents can be characterized 

by the use of any of these methods or suitable modification thereof Such methods 
include the direct or indirect sequencing of the site, the use of restriction enzymes where 
the respective alleles of the site create or destroy a restriction site, the use of allele- 
specific hybridization probes, the use of antibodies that are specific for the proteins 
10 encoded by the different alleles of the polymorphism, or any other biochemical 
interpretation. 

i) DNA Sequencing 

The most commonly used method of characterizing a polymorphism is direct 
DNA sequencing of the genetic locus that flanks and includes the polymorphism. Such 

15 analysis can be accomplished using either the "dideoxy-mediated chain termination 
method," also known as the "Sanger Method" (Sanger, F., et al, 1975) or the "chemical 
degradation method," also known as the "Maxam-Gilbert method" (Maxam, A. M., et al, 
1977). Sequencing in combination with genomic sequence-specific amplification 
technologies, such as the polymerase chain reaction may be utilized to facilitate the 

20 recovery of the desired genes (Mullis, K. et al, 1986; European Patent Application 
50,424; European Patent Application. 84,796, European Patent Application 258,017, 
European Patent Application. 237,362; European Patent Application. 201,184; U.S. 
Patents 4,683,202; 4,582,788; and 4,683,194), all of the above incorporated herein by 
reference. 

25 ii) Exonuclease Resistance 

Other methods that can be employed to determine the identity of a nucleotide 
present at a polymorphic site utilize a specialized exonuclease-resistant nucleotide 
derivative (U.S. Patent. 4,656,127). A primer complementary to an allelic sequence 
immediately 3 '-to the polymorphic site is hybridized to the DNA under investigation. If 
30 the polymorphic site on the DNA contains a nucleotide that is complementary to the 
particular exonucleotide-resistant nucleotide derivative present, then that derivative will 
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be incorporated by a polymerase onto the end of the hybridized primer. Such 
incorporation makes the primer resistant to exonuclease cleavage and thereby permits its 
detection. As the identity of the exonucleotide-resistant derivative is known one can 
determine the specific nucleotide present in the polymorphic site of the DNA. 

iii) Microsequencing Methods 

Several other primer-guided nucleotide incorporation procedures for assaying 
polymorphic sites in DNA have been described (Komher, J. S. et al, 1989; Sokolov, B. 
P., 1990; Syvanen 1990; Kuppuswamy et al, 1991; Prezant et al, 1992; UgozzoU, L. et 
al, 1992; Nyren et al, 1993). These methods rely on the incorporation of labeled 
deoxynucleotides to discriminate between bases at a polymorphic site. As the signal is 
proportional to the number of deoxynucleotides incorporated, polymorphisms that occur 
in runs of the same nucleotide resuh in a signal that is proportional to the length of the 
run (Syvanen et a/., 1990). 

iv) Extension in Solution 

French Patent 2,650,840 and PCX Application WO91/02087 discuss a solution- 
based method for determining the identity of the nucleotide of a polymorphic site. 
According to these methods, a primer complementary to allelic sequences immediately 
3'-to a polymorphic site is used. The identity of the nucleotide of that site is determined 
using labeled dideoxynucleotide derivatives which are incorporated at the end of the 
primer if complementary to the nucleotide of the polymorphic site. 

v) Genetic Bit Analysis or Solid-Phase Extension 

PCX Application W092/15712 describes a method that uses mixtures of labeled 
terminators and a primer that is complementary to the sequence 3' to a polymorphic site. 
The labeled terminator that is incorporated is complementary to the nucleotide present in 
the polymorphic site of the target molecule being evaluated and is thus identified. Here 
the primer or the target molecule is immobilized to a solid phase. 

vi) Oligonucleotide Ligation Assay (OLA) 

This is another solid phase method that uses different methodology (Landegren et 
al, 1988). Two oligonucleotides, capable of hybridizing to abutting sequences of a 
single strand of a target DNA are used. One of these oligonucleotides is biotinylated 
while the other is detectably labeled. If the precise complementary sequence is found in a 
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target molecule, the oligonucleotides will hybridize such that their termini abut, and 
create a ligation substrate. Ligation permits the recovery of the labeled oligonucleotide 
by using avidin. Other nucleic acid detection assays, based on this method, combined 
with PGR have also been described (Nickerson et al, 1990). Here PGR is used to 
5 achieve the exponential amplification of target DNA, which is then detected using the 
OLA. 

vii) Ligase/Polymerase-Mediated Genetic Bit Analysis 

U.S. Patent 5,952,174 describes a method that also involves two primers capable 
of hybridizing to abutting sequences of a target molecule. The hybridized product is 

10 formed on a solid support to which the target is immobilized. Here the hybridization 
occurs such that the primers are separated from one another by a space of a single 
nucleotide. Incubating this hybridized product in the presence of a polymerase, a ligase, 
and a nucleoside triphosphate mixture containing at least one deoxynucleoside 
triphosphate allows the ligation of any pair of abutting hybridized oligonucleotides. 

15 Addition of a ligase results in two events required to generate a signal, extension and 
ligation. This provides a higher specificity and lower "noise" than methods using either 
extension or ligation alone and unlike the polymerase-based assays, this method enhances 
the specificity of the polymerase step by combining it with a second hybridization and a 
ligation step for a signal to be attached to the solid phase. 

20 viii) Other Methods To Detect SNPs 

Several other specific methods for SNP detection and identification are presented 
below and may be used as such or with suitable modifications in conjimction with 
identifying polymorphisms of the UGTlAl genes in the present invention. Several other 
methods are also described on the SNP web site of the NGBI at the website 

25 www.ncbi.nlm.nih.gov/SNP, incorporated herein by reference. 

In a particular embodiment, extended haplotypes may be determined at any given 
locus in a population, which allows one to identify exactly which SNPs will be redimdant 
and which will be essential in association studies. The latter is referred to as 'haplotype 
tag SNPs (htSNPs)', markers that capture the haplotypes of a gene or a region of linkage 

30 disequilibrium. See Johnson et al. (2001) and Ke and Cardon (2003), each of which is 
incorporated herein by reference, for exemplary methods. 
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The VDA-assay utilizes PGR amplification of genomic segments by long PGR 
methods using TaKaRa LA Taq reagents and other standard reaction conditions. The 
long amplification can amplify DNA sizes of about 2,000-12,000 bp. Hybridization of 
products to variant detector array (VDA) can be performed by a Affymetrix High 

5 Throughput Screening Genter and analyzed with computerized software. 

A method called Ghip Assay uses PGR amplification of genomic segments by 
standard or long PGR protocols. Hybridization products are analyzed by VDA, Halushka 
et al, 1999, incorporated herein by reference. SNPs are generally classified as "Certain" 
or "Likely" based on computer analysis of hybridization patterns. By comparison to 

10 alternative detection methods such as nucleotide sequencing, "Certain" SNPs have been 
confirmed 100% of the time; and "Likely" SNPs have been confirmed 73% of the time by 
this method. 

Other methods simply involve PGR amplification following digestion with the 
relevant resti-iction enzyme. Yet others involve sequencing of purified PGR products 

15 from known genomic regions. 

In yet another method, individual exons or overlapping fragments of large exons 
are PCR-amplified. Primers are designed from published or database sequences and 
PGR-amplification of genomic DNA is performed using the following conditions: 200 ng 
DNA template, 0.5|aM each primer, SOh-M each of dGTP, dATP, dTTP and dGTP, 5% 

20 formamide, 1.5mM MgGb, 0.5U of Taq polymerase and 0.1 volume of the Taq buffer. 
Thermal cycling is performed and resulting PGR-products are analyzed by PGR-single 
strand conformation polymorphism (PCR-SSCP) analysis, under a variety of conditions, 
e.g, 5 or 10% polyacrylamide gel with 15% urea, with or without 5% glycerol. 
Electirophoresis is performed overnight. PCR-products that show mobility shifts are 

25 reamplified and sequenced to identify nucleotide variation. 

In a method called GGAP-GAI (DEMIGLAGE), sequence and alignment data 
(fi-om a PHRAP.ace file), quality scores for the sequence base calls (fi-om PHRED quality 
files), distance information (from PHYLIP dnadist and neighbour programs) and base- 
calling data (from PHRED '-d' switch) are loaded into memory. Sequences are aligned 

30 and examined for each vertical chunk ('slice') of the resulting assembly for disagreement. 
Any such slice is considered a candidate SNP (DEMIGLAGE). A number of filters are 
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used by DEMIGLACE to eliminate slices that are not likely to represent true 
polymorphisms. These include filters that: (i) exclude sequences in any given slice from 
SNP consideration where neighboring sequence quality scores drop 40% or more; (ii) 
exclude calls in which peak amplitude is below the fifteenth percentile of all base calls 
5 for that nucleotide type; (iii) disqualify regions of a sequence having a high number of 
disagreements with the consensus from participating in SNP calculations; (iv) removed 
from consideration any base call with an alternative call in which the peak takes up 25% 
or more of the area of the called peak; (v) exclude variations that occur in only one read 
direction. PHRED quality scores were converted into probability-of-error values for each 

10 nucleotide in the slice. Standard Baysian methods are used to calculate the posterior 
probability that there is evidence of nucleotide heterogeneity at a given location. 

In a method called CU-RDF (RESEQ), PGR amplification is performed from 
DNA isolated from blood using specific primers for each SNP, and after typical cleanup 
protocols to remove vmused primers and free nucleotides, direct sequencing using the 

1 5 same or nested primers. 

In a method called DEBNICK (METHOD-B), a comparative analysis of clustered 
EST sequences is performed and confirmed by fluorescent-based DNA sequencing. In a 
related method, called DEBNICK (METHOD-C), comparative analysis of clustered EST 
sequences with phred quality > 20 at the site of the mismatch, average phred quality >= 

20 20 over 5 bases 5'-FLANK and 3' to the SNP, no mismatches in 5 bases 5' and 3' to the 
SNP, at least two occurrences of each allele is performed and confirmed by examining 
traces. 

In a method identified by ERO (RESEQ), new primers sets are designed for 
electronically published STSs and used to amplify DNA from 10 different mouse strains. 

25 The amplification product from each strain is then gel purified and sequenced using a 
standard dideoxy, cycle sequencing technique with 33p_iabeled terminators. All the 
ddATP terminated reactions are then loaded in adjacent lanes of a sequencing gel 
followed by all of the ddOTP reactions and so on. SNPs are identified by visually 
scanning the radiographs. 

30 In another method identified as ERO (RESEQ-HT), new primers sets are designed 

for electronically published murine DNA sequences and used to amplify DNA from 10 
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different mouse strains. The amplification product from each strain is prepared for 
sequencing by treating with Exonuclease I and Shrimp Alkaline Phosphatase. 
Sequencing is performed using ABI Prism Big Dye Terminator Ready Reaction Kit 
(Perkin-Elmer) and sequence samples are run on the 3700 DNA Analyzer (96 Capillary 
5 Sequencer). 

FGU-CBT (SCA2-SNP) identifies a method where the region containing the SNP 
is PGR amplified using the primers SCA2-FP3 and SCA2-RP3. Approximately 100 ng 
of genomic DNA is amplified in a 50 ml reaction volume containing a final concentration 
of 5mM Tris, 25mM KCl, 0.75mM MgCh, 0.05% gelatin, 20pmol of each primer and 

10 0.5U of Taq DNA polymerase. Samples are denatured, annealed and extended and the 
PGR product is purified from a band cut out of the agarose gel using, for example, the 
QIAquick gel extraction kit (Qiagen) and is sequenced using dye terminator chemistry on 
an ABI Prism 377 automated DNA sequencer with the PGR primers. 

In a method identified as JBLAGK (SEQ/RESTRIGT), two independent PGR 

15 reactions are performed with genomic DNA. Products from the first reaction are 
analyzed by sequencing, indicating a xmique Fspl restriction site. The mutation is 
confirmed in the product of the second PGR reaction by digesting with Fsp I. 

In a method described as KWOK(l), SNPs are identified by comparing high 
quality genomic sequence data from four randomly chosen individuals by direct DNA 

20 sequencing of PGR products with dye-terminator chemistry (see Kwok et al, 1996). In a 
related method identified as KWOK (2) SNPs) are identified by comparing high quality 
genomic sequence data from overlapping large-insert clones such as bacterial artificial 
chromosomes (BACs) or PI -based artificial chromosomes (PACs). An STS containing 
this SNP is then developed and the existence of the SNP in various populations is 

25 confirmed by pooled DNA sequencing (see Taillon-Miller et al, 1998). In another 
similar method called KW0K(3), SNPs are identified by comparing high quality genomic 
sequence data from overlapping large-insert clones BAGs or PAGs. The SNPs found by 
this approach represent DNA sequence variations between the two donor chromosomes 
but the allele frequencies in the general population have not yet been determined. In 

30 method KW0K(5), SNPs are identified by comparing high quality genomic sequence 
data from a homozygous DNA sample and one or more pooled DNA samples by direct 

55142555.1 

-34- 



DNA sequencing of PGR products with dye-terminator chemistry. The STSs used are 
developed from sequence data found in publicly available databases. Specifically, these 
STSs are amplified by PGR against a complete hydatidiform mole (GHM) that has been 
shown to be homozygous at all loci and a pool of DNA samples from 80 GEPH parents 
(see Kwok a/., 1994). 

In another such method, KWOK (OverlapSnpDetectionWithPolyBayes), SNPs 
are discovered by automated computer analysis of overlapping regions of large-insert 
human genomic clone sequences. For data acquisition, clone sequences are obtained 
directly from large-scale sequencing centers. This is necessary because base quality 
sequences are not present/available through GenBank. Raw data processing involves 
analyzed of clone sequences and accompanying base quality information for consistency. 
Finished ('base perfect', error rate lower than 1 in 10,000 bp) sequences with no 
associated base quality sequences are assigned a uniform base quality value of 40 (1 in 
10,000 bp error rate). Draft sequences without base quality values are rejected. 
Processed sequences are entered into a local database. A version of each sequence with 
known human repeats masked is also stored. Repeat masking is performed with the 
program "MASKERAID." Overlap detection: Putative overlaps are detected with the 
program "WUBLAST." Several filtering steps followed in order to eliminate false 
overlap detection results, i.e. similarities between a pair of clone sequences that arise due 
to sequence duplication as opposed to true overlap. Total length of overiap, overall 
percent similarity, number of sequence differences between nucleotides with high base 
quality value "high-quality mismatches." Results are also compared to results of 
restriction fragment mapping of genomic clones at Washington University Genome 
Sequencing Genter, finisher's reports on overlaps, and results of the sequence contig 
building effort at the NGBI. SNP detection: Overiapping pairs of clone sequence are 
analyzed for candidate SNP sites with the 'POLYBAYES' SNP detection software. 
Sequence differences between the pair of sequences are scored for the probability of 
representing true sequence variation as opposed to sequencing error. This process 
requires the presence of base quality values for both sequences. High-scoring candidates 
are extracted. The search is restricted to substitution-type single base pair variations. 
Gonfidence score of candidate SNP is computed by the POLYBAYES software. 
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In method identified by KWOK (TaqMan assay), the TaqMan assay is used to 
determine genotypes for 90 random individxials. In method identified by KYUGEN(Ql), 
DNA samples of indicated populations are pooled and analyzed by PLACE-SSCP. Peak 
heights of each allele in the pooled analysis are corrected by those in a heterozygote, and 

5 are subsequently used for calculation of allele frequencies. Allele frequencies higher 
than 10% are reliably quantified by this method. Allele frequency = 0 (zero) means that 
the allele was foimd among individuals, but the corresponding peak is not seen in the 
examination of pool. Allele frequency = 0-0.1 indicates that minor alleles are detected in 
the pool but the peaks are too low to reliably quantify. 

10 In yet another method identified as KYUGEN (Method!), PGR products are post- 

labeled with fluorescent dyes and analyzed by an automated capillary electrophoresis 
system under SSCP conditions (PLACE-SSCP). Four or more individual DNAs are 
analyzed with or without two pooled DNA (Japanese pool and CEPH parents pool) in a 
series of experiments. Alleles are identified by visual inspection. Individual DNAs with 

15 different genotypes are sequenced and SNPs identified. Allele frequencies are estimated 
from peak heights in the pooled samples after correction of signal bias using peak heights 
in heterozygotes. For the PGR primers are tagged to have 5'-ATT or 5'-GTT at their ends 
for post-labeling of both strands. Samples of DNA (10 ng/ul) are amplified in reaction 
mixtures containing the buffer (lOmM Tris-HCl, pH 8.3 or 9.3, 50mM KCl, 2.0mM 

20 MgCh), 0.25 [oM of each primer, 200nM of each dNTP, and 0.025 units/fil of Taq DNA 
polymerase premixed with anti-Taq antibody. The two strands of PGR products are 
differentially labeled with nucleotides modified with RllO and R6G by an exchange 
reaction of Klenow fragment of DNA polymerase I. The reaction is stopped by adding 
EDTA, and unincorporated nucleotides are dephosphorylated by adding calf intestinal 

25 alkaline phosphatase. For the SSCP: an aliquot of fluorescently labeled PGR products 
and TAMRA-labeled internal markers are added to deionized formamide, and denatured. 
Elecfrophoresis is performed in a capillary using an ABI Prism 310 Genetic Analyzer. 
Genescan softwares (P-E Biosystems) are used for data collection and data processing. 
DNA of individuals (two to eleven) including those who showed different genotypes on 

30 SSGP are subjected for direct sequencing using big-dye terminator chemistry, on ABI 
Prism 310 sequencers. Multiple sequence trace files obtained from ABI Prism 310 are 
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processed and aligned by Phred/Phrap and viewed using Consed viewer. SNPs are 
identified by PolyPhred software and visual inspection. 

In yet another method identified as KYUGEN (Method2), individuals with 
different genotypes are searched by denaturing HPLC (DHPLC) or PLACE-SSCP 
5 (Inazuka et al, 1997) and their sequences are determined to identify SNPs. PGR is 
performed with primers tagged v^th 5'-ATT or 5'-GTT at their ends for post-labeling of 
both strands. DHPLC analysis is carried out using the WAVE DNA fragment analysis 
system (Transgenomic). PGR products are injected into DNASep column, and separated 
under the conditions determined using WAVEMaker program (Transgenomic). The two 

10 strands of PGR products that are differentially labeled with nucleotides modified with 
Rl 10 and R6G by an exchange reaction of Klenow fragment of DNA polymerase I. The 
reaction is stopped by adding EDTA, and imincorporated nucleotides are 
dephosphorylated by adding calf intestinal alkaline phosphatase. SSGP followed by 
electrophoresis is performed in a capillary using an ABI Prism 310 Genetic Analyzer. 

15 Genescan softwares (P-E Biosystems). DNA of individuals including those who showed 
different genotypes on DHPLC or SSGP are subjected for direct sequencing using big- 
dye terminator chemistry, on ABI Prism 310 sequencer. Multiple sequence trace files 
obtained from ABI Prism 310 are processed and aligned by Phred/Phrap and viewed 
using Consed viewer. SNPs are identified by PolyPhred software and visual inspection. 

20 Trace chromatogram data of EST sequences in Unigene are processed with PHRED. To 
identify likely SNPs, single base mismatches are reported from multiple sequence 
alignments produced by the programs PHRAP, BRO and PGA for each Unigene cluster. 
BRO corrected possible misreported EST orientations, while POA identified and 
analyzed non-linear alignment structures indicative of gene mixing/chimeras that might 

25 produce spurious SNPs. Bayesian inference is used to weigh evidence for true 
polymorphism versus sequencing error, misalignment or ambiguity, misclustering or 
chimeric EST sequences, assessing data such as raw chromatogram height, sharpness, 
overlap and spacing; sequencing error rates; context-sensitivity; cDNA library origin, etc. 
In method identified as MARSHFIELD(Method-B), overlapping human DNA 

30 sequences which contained putative insertion/deletion polymorphisms are identified 
through searches of public databases. PGR primers which flanked each polymorphic site 
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are selected from the consensus sequences. Primers are used to amplify individual or 
pooled human genomic DNA. Resulting PGR products are resolved on a denaturing 
polyacrylamide gel and a Phosphorlmager is used to estimate allele frequencies from 
DNA pools. 

5 

IV. PHARMACEUTICAL COMPOSITIONS 

Aqueous compositions may have an effective amount of irinotecan and/or an 
effective amount of a compoimd (second agent) that increases conjugative enzyme 
activity, as represented by a compound that increases the activity of the phase II 

10 conjugative enzyme, glucuronosyltransferase or that decreases biliary transport. Such 
compositions will generally be dissolved or dispersed in a pharmaceutically acceptable 
carrier or aqueous medium. 

The phrases "pharmaceutically or pharmacologically acceptable" refer to 
molecular entities and compositions that do not produce an adverse, allergic or other 

15 untoward reaction when administered to an animal, or human, as appropriate. As used 
herein, "pharmaceutically acceptable carrier" includes any and all solvents, dispersion 
media, coatings, antibacterial and antifungal agents, isotonic and absorption delaying 
agents and the like. The use of such media and agents for pharmaceutical active 
substances is well known in the art. Except insofar as any conventional media or agent is 

20 incompatible with the active ingredients, its use in the therapeutic compositions is 
contemplated. Supplementary active ingredients, such as other anti-cancer agents, can 
also be incorporated into the compositions. 

In addition to the compounds formulated for parenteral administration, such as 
intravenous or intramuscular injection, other pharmaceutically acceptable forms include, 

25 e.g., tablets or other solids for oral administration; time release capsules; and any other 
form cxirrently used, including cremes, lotions, mouthwashes, inhalants and the like. 
A. Parenteral Administration 

The active compovmds will often be formulated for parenteral administration, e.g., 
formulated for injection via the intravenous, intramuscular, sub-cutaneous, or even 
30 intraperitoneal routes. The preparation of an aqueous composition that contains 
irinotecan and a second agent as active ingredients will be known to those of skill in the 
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art in light of the present disclosure. Typically, such compositions can be prepared as 
injectables, either as liquid solutions or suspensions; solid forms suitable for using to 
prepare solutions or suspensions upon the addition of a liquid prior to injection can also 
be prepared; and the preparations can also be emulsified. 
5 Solutions of the active compounds as free base or pharmacologically acceptable 

salts can be prepared in water suitably mixed with a surfactant, such as 
hydroxypropylcellulose. Dispersions can also be prepared in glycerol, liquid 
polyethylene glycols, and mixtures thereof and in oils. Under ordinary conditions of 
storage and use, these preparations contain a preservative to prevent the growth of 

10 microorganisms. 

The pharmaceutical forms suitable for injectable use include sterile aqueous 
solutions or dispersions; formulations including sesame oil, peanut oil or aqueous 
propylene glycol; and sterile powders for the extemporaneous preparation of sterile 
injectable solutions or dispersions. In all cases the form must be sterile and must be fluid 

15 to the extent that easy syringability exists. It must be stable under the conditions of 
manufacture and storage and must be preserved against the contaminating action of 
microorganisms, such as bacteria and fungi. 

The active compounds may be formulated into a composition in a neutral or salt 
form. Pharmaceutically acceptable salts, include the acid addition salts (formed with the 

20 free amino groups of the protein) and which are formed with inorganic acids such as, for 
example, hydrochloric or phosphoric acids, or such organic acids as acetic, oxalic, 
tartaric, mandelic, and the like. Salts formed with the free carboxyl groups can also be 
derived from inorganic bases such as, for example, sodium, potassium, ammonium, 
calcium, or ferric hydroxides, and such organic bases as isopropylamine, frimethylamine, 

25 histidine, procaine and the like. 

The carrier can also be a solvent or dispersion medixrai containing, for example, 
water, ethanol, polyol (for example, glycerol, propylene glycol, and liquid polyethylene 
glycol, and the like), suitable mixtures thereof, and vegetable oils. The proper fluidity 
can be maintained, for example, by the use of a coating, such as lecithin, by the 

30 maintenance of the required particle size in the case of dispersion and by the use of 
surfactants. The prevention of the action of microorganisms can be brought about by 
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various antibacterial ad antifungal agents, for example, parabens, chlorobutanol, phenol, 
sorbic acid, thimerosal, and the like. In many cases, it will be preferable to include 
isotonic agents, for example, sugars or sodium chloride. Prolonged absorption of the 
injectable compositions can be brought about by the use in the compositions of agents 
5 delaying absorption, for example, aluminum monostearate and gelatin. 

Sterile injectable solutions are prepared by incorporating the active compounds in 
the required amount in the appropriate solvent with various of the other ingredients 
enumerated above, as required, followed by filtered sterilization. Generally, dispersions 
are prepared by incorporating the various sterilized active ingredients into a sterile 

10 vehicle which contains the basic dispersion medium and the required other ingredients 
from those enumerated above. In the case of sterile powders for the preparation of sterile 
injectable solutions, the preferred methods of preparation are vacuvim-drying and freeze- 
drying techniques which yield a powder of the active ingredient plus any additional 
desired ingredient from a previously sterile-filtered solution thereof 

1 5 Upon formulation, solutions will be administered in a manner compatible with the 

dosage formulation and in such amount as is therapeutically effective. The formulations 
are easily administered in a variety of dosage forms, such as the type of injectable 
solutions described above, with even drug release capsules and the like being 
employable. 

20 For parenteral administration in an aqueous solution, for example, the solution 

should be suitably buffered if necessary and the liquid diluent first rendered isotonic with 
sufficient saline or glucose. These particular aqueous solutions are especially suitable for 
intravenous, intramuscular, subcutaneous and intraperitoneal administration. In this 
connection, sterile aqueous media which can be employed will be known to those of skill 

25 in the art in light of the present disclosure. For example, one dosage could be dissolved 
in ImL of isotonic NaCl solution and either added to lOOOmL of hypodermoclysis fluid 
or injected at the proposed site of infiision, (see for example, "Remington's 
Pharmaceutical Sciences" 15th Edition, pages 1035-1038 and 1570-1580). Some 
variation in dosage will necessarily occur depending on the condition of the subject being 

30 treated. The person responsible for administration will, in any event, determine the 
appropriate dose for the individual subject. 
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B. Oral Administration 

In certain embodiments, active compoxmds may be administered orally. This is 
contemplated for agents which are generally resistant, or have been rendered resistant, to 
proteolysis by digestive enzymes. Such compoxmds are contemplated to include all those 
5 compounds, or drugs, that are available in tablet form from the manufacturer and 
derivatives and analogues thereof. 

For oral administration, the active compotmds may be administered, for example, 
with an inert diluent or with an assimilable edible carrier, or they may be enclosed in hard 
or soft shell gelatin capsule, or compressed into tablets, or incorporated directly with the 

10 food of the diet. For oral therapeutic administration, the active compounds may be 
incorporated with excipients and used in the form of ingestible tablets, buccal tables, 
troches, capsules, elixirs, suspensions, syrups, wafers, and the like. Such compositions 
and preparations should contain at least 0.1% of active compound. The percentage of the 
compositions and preparations may, of cotirse, be varied and may conveniently be 

15 between about 2 to about 60% of the weight of the unit. The amount of active 
compounds in such therapeutically useful compositions is such that a suitable dosage will 
be obtained. 

The tablets, troches, pills, capsules and the like may also contain the following: a 
binder, as gum tragacanth, acacia, cornstarch, or gelatin; excipients, such as dicalcium 

20 phosphate; a disintegrating agent, such as com starch, potato starch, alginic acid and the 
like; a lubricant, such as magnesiirai stearate; and a sweetening agent, such as sucrose, 
lactose or saccharin may be added or a flavoring agent, such as peppermint, oil of 
wintergreen, or cherry flavoring. When the dosage unit form is a capsule, it may contain, 
in addition to materials of the above type, a liquid carrier. Various other materials may 

25 be present as coatings or to otherwise modify the physical form of the dosage unit. For 
instance, tablets, pills, or capsules may be coated with shellac, sugar or both. A syrup of 
elixir may contain the active compoimds sucrose as a sweetening agent methyl and 
propylparabens as preservatives, a dye and flavoring, such as cherry or orange flavor. Of 
course, any material used in preparing any dosage unit form should be pharmaceutically 

30 pure and substantially non-toxic in the amounts employed. In addition, the active 
compounds may be incorporated into sustained-release preparation and formulations. 
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Upon formulation, the compounds will be administered in a manner compatible 
with the dosage formulation and in such amovmt as is therapeutically effective. The 
formulations are easily administered in a variety of dosage forms, such as those described 
below in specific examples. 
5 C. Liposomes 

In a particular embodiment, liposomal formulations are contemplated. Liposomal 
encapsulation of pharmaceutical agents prolongs their half-lives when compared to 
conventional drug delivery systems. Because larger quantities can be protectively 
packaged, this allow the opportunity for dose-intensity of agents so delivered to cells. 
10 This would be particularly attractive in the chemotherapy of cervical cancer if there were 
mechanisms to specifically enhance the cellular targeting of such liposomes to these 
cells. 

"Liposome" is a generic term encompassing a variety of single and multilamellar 
lipid vehicles formed by the generation of enclosed lipid bilayers. Phospholipids are used 

15 for preparing the liposomes according to the present invention and can carry a net 
positive charge, a net negative charge or are neutral. Dicetyl phosphate can be employed 
to confer a negative charge on the liposomes, and stearylamine can be used to confer a 
positive charge on the liposomes. Liposomes are characterized by a phospholipid bilayer 
membrane and an inner aqueous medium. Multilamellar liposomes have multiple lipid 

20 layers separated by aqueous medium. They form spontaneously when phospholipids are 
suspended in an excess of aqueous solution. The lipid components imdergo self- 
rearrangement before the formation of closed structures and entrap water and dissolved 
solutes between the lipid bilayers (Ghosh and Bachhawat, 1991). Also contemplated are 
cationic lipid-nucleic acid complexes, such as lipofectamine-nucleic acid complexes 

25 

V. KITS 

Any of the compositions described herein may be comprised in a kit. In a non- 
limiting example, reagents for determining the genotype of one or both UGTlAl genes 
are included in a kit. The kit may further include individual nucleic acids that can be 
30 amplify and/or detect particular nucleic acid sequences the UGTlAl gene. It may also 
include one or more buffers, such as a DNA isolation bufffers, an amplification buffer or 
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a hybridization buffer. The kit may also contain compounds and reagents to prepare 
DNA templates and isolate DNA from a sample. The kit may also include various 
labeling reagents and compounds. 

The components of the kits may be packaged either in aqueous media or in 
5 lyophilized form. The container means of the kits will generally include at least one vial, 
test tube, flask, bottle, syringe or other container means, into which a component may be 
placed, and preferably, suitably aliquoted. Where there are more than one component in 
the kit (labeling reagent and label may be packaged together), the kit also will generally 
contain a second, third or other additional container into which the additional components 

10 may be separately placed. However, various combinations of components may be 
comprised in a vial. The kits of the present invention also will typically include a means 
for containing the nucleic acids, and any other reagent containers in close confinement 
for commercial sale. Such containers may include injection or blow-molded plastic 
containers into which the desired vials are retained. 

15 When the components of the kit are provided in one and/or more liquid solutions, 

the liquid solution is an aqueous solution, with a sterile aqueous solution being 
particularly preferred. However, the components of the kit may be provided as dried 
powder(s). When reagents and/or components are provided as a dry powder, the powder 
can be reconstituted by the addition of a suitable solvent. It is envisioned that the solvent 

20 may also be provided in another container means. 

A kit will also include instructions for employing the kit components as well the 
use of any other reagent not included in the kit. Instructions may include variations that 
can be implemented. 

It is contemplated that such reagents are embodiments of kits of the invention. 
25 Such kits, however, are not limited to the particular items identified above and may 
include any reagent used directly or indirectly in the detection of polymorphisms in the 
UGTlAl gene or the activity level of the UGTl Al polypeptide. 

EXAMPLES 

30 The following examples are included to demonstrate preferred embodiments of 

the invention. It should be appreciated by those of skill in the art that the techniques 
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disclosed in the examples which follow represent techniques discovered by the inventor 
to function well in the practice of the invention, and thus can be considered to constitute 
preferred modes for its practice. However, those of skill in the art should, in light of the 
present disclosure, appreciate that many changes can be made in the specific 
5 embodiments which are disclosed and still obtain a like or similar resvilt without 
departing from the spirit and scope of the invention. 

EXAMPLE 1 

MATERIALS AND METHODS FOR EXAMPLES 2-5 
10 Chemicals and reagents 

Exonuclease I and shrimp alkaline phosphatase (exo/SAP) was purchased from 
USB (Cleveland, Ohio, USA). ABI Big Dye terminator cycle-sequencing kit was 
purchased from Applied Biosystems (Foster City, California, USA). Primers for 
amplification, sequencing of the PBREM, and amplification of the (TA)n polymorphism 

15 were obtained from GibcoBRL (Invitrogen Co., Carlsbad, California, USA). SN-38 was 
kindly provided by Dr Kiyoshi Terada (Yakult Honsha Co., Ltd, Japan). Camptothecin, 
UDPGA, magnesium chloride, trizma base, potassium monohydrogen phosphate and 1- 
heptanesulfonic acid were purchased from Sigma-Aldrich (St. Louis, Missouri, USA). 
Acetonifrile, tefrahydrofiiran and hydrochloric acid were obtained from Fisher Scientific 

20 (Hanover, Illinois, USA). 

Human livers 

Normal human livers (n = 83) were mainly obtained from Liver Tissue 
Procurement and Distribution System (National Institutes of Diabetes and Digestive and 

25 Kidney Diseases, Minneapolis, Mirmesota). DNA was isolated by using Qiagen 
RNA/DNA Maxi Kit (Qiagen Inc., Valencia, California, USA), and microsomes were 
isolated following differential centrifiigation methods (Purba et al, 1987). DNA and 
microsomes were provided by the Liver Core Bank Facility (St. Jude Children's Research 
Hospital) of the Pharmacogenetics of Anticancer Agents Research (PAAR) Group. In 

30 order to identify livers in which enzyme degradation occurred, liver samples consistently 
comprised in the 10th percentile of UGTlAl, UGT1A9 and UGT2B7 activities were 
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sought. UGT1A9 and UGT2B7 activities were measured using specific probes (data not 
shown) (Ramirez et al., 2002 and Innocenti, et al, 2001). Out of eight samples within 
the 10th percentile of UGTlAl, only one sample was comprised within the 10th percentile 
of activities of the other two enzyme activities. If different handling/storage of the liver 

5 or microsomal protein degradation occurred in that sample, this should not have affected 
the degree of phenotype/genotype correlation because the individual had a 7/7 genotype, 
and among the 7/7 genotype samples (n = 1 1), it had the 4th lower value. Moreover, lack 
of correlation between UGTlAl and UGT2B7 activities (n = 83, r = 0.07, P = 0.5) shows 
that differences in tissue integrity and microsome stability have probably a mild influence 

1 0 (if any) on the UGT phenotype. 

The ethnic composition of the 83 liver donors comprised: Caucasians 68%, 
African- Americans 18%, Asians 1%, others 2%. The percentage of samples of unknown 
ethnic origin was 12%. 

1 5 Genotyping of (TA)„ polymorphism 

In order to genotype the (TA)n polymorphism, approximately 40 ng of DNA was 
subjected to amplification by polymerase chain reaction (PGR). The amplification 
primers used have been previously described (Monaghan et al, 1996), where the 
sequence of the forward primer is 5'-GTCACGTGACACAGTCAAAC-3' (SEQ ID 

20 N0:2) and that of the reverse primer is 5'-TTTGCTCCTGCCAGAGGTT-3' (SEQ ID 
N0:3). These primers flank the polymorphic TA locus in the promoter region of the 
UGTlAl gene and amplify a 98bp fragment when a (TA)6 allele is present and a 100 bp 
fragment when a (TA)? allele is present. In the presence of (TA)5 and (TA)8 alleles, 96 bp 
and 102 bp alleles are amplified. The reverse primer is labeled with a fluorescent dye at 

25 its 5'-end to permit visualization of the amplification product. The amplification 
reactions were performed in a 10|j,l volume consisting of 1.5 mmol MgCb, 250 mmol 
dNTPs, 0.8 mmol of each primer and 0.5 U of Taq polymerase (Amplitaq Gold from 
Applied Biosystems). The polymerase was activated at 95°C for 10 min and DNA 
amplified for 35 cycles at 95°C for 30 sec, 55°C for 30 sec and 72°C for 30 sec, followed 

30 by a final extension at 72°C for 1 0 min. Control DNAs from individuals known to have a 
6/6, 6/7 and 7/7 genotype were included in the PGR analysis. PGR fragments were 
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subjected to gel electrophoresis on an ABI 377 DNA analyzer (Applied Biosystems). 
Amplified products were diluted in a formamide and dextran blue loading buffer and 1 j^l 
combined with 1 \x\ of size standard (GS-350 from Applied Biosystems), denatured at 
95°C, and loaded onto a 6% denaturing polyacrylamide gel. Electrophoresis was 
5 performed for 3.5 hours following manufacturers recommendations. The Genescan and 
Genotyper software (version 3.7, Applied Biosystems) was used to analyze fragments for 
size determination. 



Sequencing of PBREM 

10 A 606 bp region (-3641 to -3036) including PBREM was successfully PCR- 

amplified and sequenced in 81 of the 83 human liver DNAs and 22 of the 24 DNA 
samples from African-American individuals (Americans of African descent, bom in the 
USA) included in the NIGMS HGCR Human VariationPanel (Coriell Institute for 
Medical Research, Camden, New Jersey, USA). The reference sequence shown in Fig. 1 

15 is that deposited in the GenBank database (accession number AF3 13454). Amplification 
of the PGR product was performed in a 10 or 25-pl reaction volume using the following 
primers: 5'-CTGGGGATAAACATGGGATG-3' SEQ ID N0:4 (forward) and 5'- 
CACCACCACTTCTGGAACCT-3' SEQ ID N0:5 (reverse). These primers were 
designed using Primer3 software (Rozen et al, 1998). PGR conditions were 2 min at 

20 94°C, 32 or 33 cycles of a three-step cycling program (30 sec at 94°C, 30 sec at 66.8°C 
and 1 min at 72°C) and 72°C for 3 min. Following exo/SAP cleanup of the PGR product, 
this amplicon was then sequenced in the forward and reverse directions using the 
amplification primers. Big Dye terminator chemistry, and run on an ABI 3700 (Applied 
Biosystems) following the manufactvirer's protocol. Sequences were analyzed and 

25 individuals genotyped using the Poly- Phred software (Nickerson et al., 1997). To 
determine the ancestral state of the polymorphisms found in humans, the sequence was 
compared to that of baboon (accession number AC091778). 



SN- 38 glucuronidation assay in human liver microsomes 

30 Samples were phenotyped by using SN-38 as a substrate for UGTIAI. The 

incubation mixture consisted of 5 |amol SN-38, 10 mmol MgGla, 1 mg/ml microsomes. 
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0.025 mol Tris-HCl (pH 7.4) and 5 mmol UDP-GA. Samples were incubated for 30 min 
at 37°C. The reaction was stopped by the addition of methanol. These conditions were 
selected after previous optimization of the enzyme reaction (Iyer et al, 1998). 
Camptothecin (75 ng) was used as an internal standard. SN-38 glucuronidation was 

5 measured by HPLC (Hitachi Instruments Inc., San Jose, California, USA) with 
fluorescence detection (X, excitation = 355 nm, % emission = 515 nm). A nBondapakTw 
Ci8 column (3.9 X 300 mm, 10 \xm; Waters Corp., Milford, Massachusetts, USA) and 
laBondapak™ Cn guardpak (Waters Corp.) were used. A mobile phase of 8/4/88 
acetonitrile/tetra-hydrofuran/0.9 mmol sodium heptanedfonic acid in 50 mmol potassium 

10 dihydrogen phosphate (pH 4) was used during the first 7 min of the run. From 7.1-25 
min, the eluent consisted of 30/70 acetonitrile/5 mmol sodium heptanesulfonic acid in 50 
mmol potassium dihydrogen phosphate (pH 4). The flow rate was 0.9 ml/min. Retention 
times for SN-38G, SN-38 and camptothecin were 13.3, 18.4 and 19.3 min, respectively. 
SN-38 glucuronidation rates were reported as the ratios between SN-38 glucuronide (SN- 

15 38G) and internal standard (IS) peak heights. The intra-assay variability was determined 
by performing 10 incubations on the same day using a pool of hvmian liver microsomes. 
The inter-assay variability was evaluated by incubating a pool of human liver 
microsomes in triplicate on three different days. The inter- and intra- assay variabilities 
were within 7%. 

20 

Statistical analysis 

The significance of linkage disequilibrium between pairs of polymorphic sites 
was assessed using genotypic data and a likelihood ratio test provided in ARLEQUIN, 
version 2 (Schneider et al, 2000). ARLEQUIN was also used to run a modified Markov- 

25 chain random walk algorithm to test for Hardy- Weinberg equilibrium. Next, multisite 
haplotypes were estimated using the program PHASE (Stephens et al, 2001). Because 
this program does not accept both bi- allelic and multi-allelic polymorphic sites, 
haplotypes were estimated only for individuals with either the (TA)6 or (TA)? alleles. 

Thirteen individuals were heterozygous for the (TA)5 or (TA)8 repeat, three of 

30 which were heterozygous only at the TA repeat and therefore unambiguous at the other 
sites. For the remaining 10 individuals, haplotypes were determined manually by 
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assuming that the chromosome with the (TA)6 or (TA)? allele contained a haplotype 
previously identified by the PHASE analysis. In one case, this method would have 
resulted in a new (TA)8 haplotype. However, it is more likely that this individual would 
instead have a novel (TA)6 haplotype (V), which is consistent with the observation that 
5 the (TA)6 allele is found on multiple haplotypes, including other rare ones. An incorrect 
assignment would have little or no affect on the subsequent analyses because the novel 
haplotype only occurs once out of 103 individuals and not in a sample used in studies of 
correlation with phenotype. 

The effective nimiber of haplotypes was calculated as the reciprocal of the sum of 

10 the frequency squared. Diversity in (TA)6 haplotypes in Caucasians and African- 
Americans, based on the numbers and fi-equencies of haplotypes and adjusted by sample 
size, was estimated by DnaSP version 3.53 (Rozas et al.) as well as their SDs. Statistical 
significance was assessed using a t-test as previously described (Nei, 1987). The chi- 
square test was used to analyze the differences in genotype/haplotype firequencies 

1 5 between Caucasians and African- Americans. 

UGTlAl activity was phenotyped by measuring SN-38 glucuronidation rates of 
each liver as the mean ± SD of a single experiment performed in triplicate. The statistical 
analysis of the relationship between the (TA)n polymorphism and phenotype was planned 
to assess first the genotype effect on phenotype in the population sample (n = 83) using 

20 the analysis of variance (ANOVA). If the genotype effect was statistically significant 
then, within each ethnic group, a test of trend across the genotypes was performed using 
the exact Jonkheerer-Terpstra (JT) test (Gibbons et al, 1992). Pairwise comparisons 
between two genotypes were performed using an exact one-sided Wilcoxon test. 
Moreover, trend analysis and pairwise comparisons were performed in genotypes 

25 expressed as the sum of TA repeats in both chromosomes {i.e. in samples with < 12 (5/6, 
6/6, 5/7), 13 (6/7) and < 14 (7/7, 6/8, 7/8) TA repeat genotypes). Concerning the 
haplotype-phenotype relationship, two-sided exact Wilcoxon tests were used to compare 
the SN-38 glucuronidation rates between two haplotypes. SAS system (SAS histitute. 
Inc., Gary, North Carolina) and StatXact-5 (CYTEL Software Corporation, Cambridge, 

30 Massachusetts, USA) were used for statistical analysis. GraphPad software version 3.02 
(GraphPad Software Inc., San Diego, California, USA) was used for graphical analysis. 
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EXAMPLE 2 
GENOTYPING OF (TA)n POLYMORPHISM 
The (TA)6 allele was the most common allele with a frequency of 0.58 while the 
(TA)7 allele had a frequency of 0.36 (Table 1). (TA)5 and (TA)8 alleles were also found, 
although at lower frequencies (0.02 and 0.05, respectively). In the population sample (n 
= 107), the most common genotype was 6/7 (0.41), followed by the 6/6 genotype (0.34). 
Rare genotypes (<0.02) included 5/6, 5/7 and 5/8 genotypes. The (TA)6 and (TA)^ allele 
frequencies were not significantly different between Caucasians and African-Americans 
(chi-square test, P= 0.7). Similarly, 6/6, 6/7, and 7/7 genotype frequencies were not 
different between the two ethnic groups (chi-square test, P = 0.8). One Asian individual 
had a 6/6 genotype, while two individuals with other ethnicities had 6/7 and 7/7 
genotypes. 



15 Table 1: (TA), 



(TA)n 



Population 

Sample 

(n=107) 



polymorphism: genotype frequencies 



5/8 



Caucasians 
(n=56) 



African- 
Americans 
(n=39) 



EXAMPLE 3 

20 SEQUENCING OF PBREM 

In 103 samples, six polymorphisms were found, and two of them (-3279G>T and 
-31560A) are common, with frequencies of 0.39 and 0.30, respectively (FIG. 1, Table 
2). All six polymorphisms are in Hardy-Weinberg equilibrium (P > 0.5). Based upon 
comparisons to the baboon sequence (accession number AC091778, which is 
25 incorporated herein by reference), it is likely that -3279G and -31566 are the ancestral 
states. The most common -3279G>T polymorphism is located in the spacer sequence of 
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the NR3 domain of PBREM (FIG. 1). No variants were found in the gtNRl domain, the 
binding site for constitutive active receptor (CAR). -3279G was significantly more 
common among African- Americans compared to Caucasians (chi squared = 13.82, P = 
0.001) while the frequency of -3156A did not significantly differ between the two ethnic 
5 groups (chi-square test, P = 0.9). 



Table 2 Sequencing of PBREM: genotype frequencies 



Position 


-3440 


-3401 


-3279 


-3177 


-3175 


-3156 


Genotype 


CC 


CA 


TT 


TC 


GG 


GT 


TT 


CC 


CG 


AA 


AG 


GG 


GA 


AA 


Population 

Sample 

(n=103) 


0.96 


0.04 


0.99 


0.01 


0.38 


0.46 


0.16 


0.99 


0.01 


0.99 


0.01 


0.49 


0.43 


0.08 


Caucasian 
(n=55) 


0.96 


0.04 


1 


0 


0.18 


0.58 


0.24 


1 


0 


1 


0 


0.47 


0.44 


0.09 


African- 
Americans 
(n=37) 


0.95 


0.05 


0.97 


0.03 


0.73 


0.24 


0.03 


0.97 


0.03 


0.97 


0.03 


0.51 


0.41 


0.06 



10 EXAMPLE 4 

LINKAGE DISEQUILIBRIUM AND HAPLOTYPE STRUCTURE OF THE 

UGTlAl PROMOTER 
A likelihood ratio test detected significant pairwise linkage disequilibrium 
between sites -3279, -3156 and the (TA)„ polymorphism in our population sample (n = 

15 103, P < 0.0001). When only the common (TA)6 and (TA)7 alleles were used for the 
linkage disequilibrium analysis, the same resuhs were obtained (P < 0.0001). When 
pairwise linkage disequilibrium was separately assessed in Caucasians and African- 
Americans, highly significant linkage disequilibrium was similarly detected in 
Caucasians (P < 0.0001). In African-Americans, pairwise linkage disequilibrium was 

20 also detected between all sites, however, the level of significance varied greatly between 
the pairwise comparisons. Only linkage disequilibrium between (TA)n and -3156 had 
significance levels similar to those seen for Caucasians (P < 0.0005) while linkage 
disequilibrium had only low levels of significance between (TA)n and -3279 (P = 0.02) 
and between -3279 and -3156 (P = 0.04). 

25 Multisite haplotype inference resulted in 10 haplotypes spanning the PBREM 

variants and the (TA)n polymorphism (Table 3). Haplotypes I-V include the (TA)6 allele, 
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and haplotype I differs from haplotype II at position -3279 in the NR3 domain of 
PBREM. Haplotypes VI, VII and VIII include the (TA)^ repeat and haplotypes VI and 
VII differ from each other at position -3156. There is a suggestion that the haplotype 
structure of the (TA)6 allele is different in the African- American subsample. Compared 
5 to Caucasians, haplotype I is less common in African-Americans (chi squared = 27.06, P 
< 0.0001), while haplotype II is more common (chi squared = 14.84, P = 0.0001). 
Differences in haplotype VI and VII frequencies were not statistically significant between 
the two groups (chi- square test, P = 0.44 and 0.48, respectively). 

Among the samples examined, 21 different combinations of these haplotypes 

10 were found. In Caucasians, the most frequent haplotype pairs are I/VI (0.35), I/I (0.24) 
and I/II (0.11), while in African-Americans, they are I/II (0.11), IWI (0.11), II/VIII 
(0.08), I/VI (0.08), II/VII (0.08) and VI/VI (0.08). The effective numbers of haplotypes, 
which reflect how many relatively high frequency haplotypes are observed, were 5.2 and 
2.6 in African-Americans and Caucasians, respectively (Table 3). Finally, diversity (± 

15 SD) of (TA)6 haplotypes was 0.555 ± 0.070 and 0.262 ± 0.065 in African-Americans and 
Caucasians, respectively (P < 0.05). 
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African- 
Americans (n=37) 


0.15 


0.28 


0.03 


0.01 




0.01 
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0.12 


0.04 
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EXAMPLE 5 

UGTlAl PHENOTYPING AND ASSOCIATION WITH (TA)„ POLYMORPHISM 
AND HAPLOTYPES 

UGTlAl activity was measured as SN-38 glucuronidation rates in 83 human liver 
5 microsomes. A 46% coefficient of variation (1.90 f 0.87 SN-38G/IS, mean f SD) and a 
10-fold range in SN-38 glucuronidation were observed. 

Because of the small number of subjects in the 5/7, 5/6, 6/8 and 7/8 genotypes, 
only 6/6, 6/7 and 7/7 were used in the ANOVA analysis. The phenotype was 
significantly different across these three genotypes (P =0.008) (FIG. 2a). The degree of 

10 variation of the SN-38 glucuronidation rate across the genotypes was similar in different 
ethnic groups (P > 0.1). A significantly decreasing trend was shown across the 6/6, 6/7 
and 7/7 genotypes in Caucasians (P < 0.001, JT test, FIG. 2b) and across the 6/6, 6/7, 6/8 
and 7/7 genotypes in African-Americans (P = 0.033, JT test) (FIG. 2c). When samples 
with Asian (n = 1), other (n = 2) and unknown (n = 10) ethnic background were pooled 

15 together, no significant trend could be found across (TA)n genotypes (P > 0.1, JT test) 
(FIG. 2d). In the Caucasian sample, pairwise comparisons of the phenotype between two 
genotype groups showed significant differences between the 6/7 and 7/7 (P= 0.007, one- 
sided exact Wilcoxon test) and 6/6 and 7/7 groups (P = 0.0002). No pair-wise 
comparison was significant within African-Americans, probably due to small number of 

20 samples of each genotype. 

When (TA)n genotypes were regarded as the sum of TA repeat number in both 
chromosomes (i.e. < 12 (5/6, 6/6, 5/7), 13 (6/7) and > 14 (7/7, 6/8, 7/8) genotypes), a 
significant trend of reduced UGTlAl activity (P<0.01) was measured across the three 
groups (the lowest being the > 14 genotype group) in the whole sample population, in 

25 Caucasians, in African-Americans but not in samples with Asian/other/unknown ethnicity 
(P = 0.66). Pairwise comparisons (one-sided exact Wilcoxon test) showed significantly 
reduced UGTlAl activity (P<0.01) in > 14 compared to 13 and < 12 genotypes, and in 13 
compared to < 12 genotypes in the whole sample population and in Caucasians. In 
African- Americans < 12 genotypes had significantly higher UGTlAl activity compared to 

30 either 13 or > 14 genotypes (P = 0.028 and 0.016, respectively), but UGTlAl activity was 
not significantly different between 13 and > 14 genotypes (P = 0.1 1). 
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In samples of Caucasian and African origin, SN-38 glucuronidation rate varies 
significantly across the haplotypes with a decreasing trend (P < 0.0001, JT test) (FIG. 3). 
However, this apparent haplotype-phenotype correlation is likely to be due to the effect 
of the (TA)n polymorphism that is in linkage disequilibrium with the PBREM variants. 
5 Hence, the possible functional effects of the common -32790T and -3 1 56G>A variants 
were investigated by comparing the SN-38 glucuronidation rates across genotypes that 
differed only by the variant being examined. Concerning the -3279G>T variant, SN-38 
glucuronidation was reduced in I/II pairs compared to I/I pairs among Caucasians, 
although without reaching statistical significance (2.06 ± 0.74 versus 2.53 ± 0.82 SN- 
10 38G/IS, respectively) (Wilcoxon rank sum test, P = 0.18). Concerning the -3156G>A 
variant, although SN-38 glucuronidation is slightly reduced in WII compared to WI 
pairs, the difference is not statistically significant (Wilcoxon rank sum test, P = 0.64). 
EXAMPLE 6: 
MATERIALS AND METHODS FOR EXAMPLE 7 
15 Patient Selection 

Patients with histologically confirmed solid tumors or lymphoma known to 
respond to irinotecan or for which no therapy of proven benefit exists were eligible to 
participate in this study. Other eligibility criteria included measurable disease by 
radiologic imaging or physical examination; age of at least 18 years; Karnofsky 
20 performance status of at least 70% (ambulatory and capable of self-care); and adequate 
organ function defined as absolute neutrophil count (ANC) > 1500 i^l"', platelet count > 
100,000 serum creatinine level < 1.5 mg/dl or creatinine clearance > 60 ml/min, AST 
and ALT levels < 5 times the upper limit of normal, and conjugated bilirubin within 
normal limits. Patients must have been off previous anticancer therapy, including 
25 radiation therapy, for at least 4 weeks (6 weeks if the previous treatment included a 
nitrosourea or mitomycin C) and off colony stimulating factor for at least 2 weeks. 
Patients with a history of inflammatory bowel disease requiring therapy, chronic diarrheal 
syndrome, paralytic ileus, or organ or stem cell transplant were excluded from the study. 
Concurrent use of medications that may be substrates of the UGTlAl enzyme or that 
30 may be inducers or inhibitors of UGTlAl activity was not permitted. Pregnant and 
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lactating women were also excluded from participation, and those with reproductive 
potential were required to use an effective contraceptive method if sexually active. 

Treatment Protocol 

Irinotecan was supplied by the National Cancer Institute (NCI) as an intravenous 
5 solution with concentration 20 mg/ml in either 2 ml or 5 ml vials. The amount of 
irinotecan to be administered was removed aseptically from the vial and added to 500 ml 
of 0.9% saline or 5% dextrose injection, USP. Thirty minutes after pretreatment with 20 
mg intravenous ondansetron, irinotecan 350 mg/m^ was administered as a 90 minute 
intravenous infusion once every 3 weeks — a standard dose and schedule. History, 

10 physical examination, complete blood count (CBC) with differential, serum chemistry 
profile (electrolj'tes, blood urea nitrogen, creatinine, glucose, albumin, alkaline 
phosphatase, GGTP, AST, ALT, total and conjugated bilirubin, uric acid, and lactate 
dehydrogenase), and coagulation profile (prothrombin time and partial thromboplastin 
time) were conducted prior to first treatment. Thereafter, history, physical examination, 

15 and toxicity assessment were conducted on day 1 of each cycle unless treatment-related 
toxicity required more frequent follow up. CBC and serum chemistry profile were 
obtained weekly throughout treatment, though CBCs were obtained 3 times per week 
with the appearance of grade 3 or 4 neutropenia or thrombocytopenia. Toxicity 
assessment was done according to the NCI common toxicity criteria, version 2.0 

20 (website: ctep.cancer.gov). Objective tumor assessment by appropriate radiographic 
imaging was performed prior to starting therapy and after every 2 cycles. 

Toxicity Management and Dose Modification 

For patients who experienced diarrhea, abdominal pain, or diaphoresis within 24 
hours of irinotecan administration, 0.25 mg to 1 mg of intravenous atropine was 

25 considered. Delayed diarrhea, defined as diarrhea occurring more than 24 hours after 
irinotecan administration, was treated promptly with loperamide 4 mg at the onset and 
then with 2 mg every 2 hours until the patient was diarrhea-free for at least 12 hours. For 
patients who failed loperamide therapy, diphenoxylate, octreotide, and tincture of opium 
were sequentially added as needed. Patients were instructed to aggressively hydrate 

30 orally and were admitted to the hospital for intravenous electrolyte and fluid replacement 

55142555.1 

-55- 



when necessary. A new course of therapy was not started until the ANC recovered to at 
least 1500 lal'', the platelet count recovered to at least 100,000 iiV\ and treatment-related 
diarrhea fully resolved. Patients with grade 3 or 4 toxicities of any kind were dose- 
reduced by 50 mg/m^ for subsequent cycles. 
Sample Collection 

Prior to the first irinotecan infusion, venous blood (4.5 ml) for genotyping was 
collected in purple top Vacutainer® tubes containing EDTA (Becton, Dickinson, and 
Company, Franklin Lakes, NJ) and stored at -80°C for no more than 5 days prior to 
analysis. Venous blood for pharmacokinetic analysis was collected on day 1 of cycle 1 
for pharmacokinetic analysis. Samples of 7 ml were collected into green top sodium 
heparinized Vacutainer® tubes prior to the infusion; 30, 60, and 90 minutes during the 
infusion; and 10, 20, 30, 45, and 60 minutes and 1.5, 2, 4, 6, 12, and 24 hours after the 
infusion. Samples were centrifuged (2500 rpm, 20 min, 4°C) and the plasma was 
immediately separated, transferred as two aliquots into storage tubes, frozen at -80°C 
until analysis. 

UGTlAl Genotyping Assays 

The variants typed in this study are listed in Table 4. The UGTlAl (TA)nTAA 
polymorphism was genotyped by PCR and product sizing as previously described (Te et 

al, 2000). Alleles with 6 TA repeats resulted in a 98 bp fragment while alleles with 7 
TA repeats resulted in a 100 bp fragment. Alleles with 5 TA and 8 TA repeats resulted in 
96 bp and 102 bp fragments respectively. Alleles with 5, 6, 7, and 8 TA repeats are 
reported as (TA)n and genotypes are assigned based upon the number of TA repeats in 
each allele, i.e., 6/6, 6/7, 7/7, 6/8, et cetera. 

The variants in the 5 'upstream region (-32790T and -3156G>A) and in exon 1 
[21 1G>A (G71R) and 6860A (P229Q)] were genotyped by single base extension (SBE) 
and separated on a denaturing high performance liquid chromatography (DHPLC) system 
(Devaney et al, 2001). Genotyping of the -32790T and -3156G>A variants was 
performed by PCR amplification of a 333 bp fragment in the UGTlAl 5' upstream 
region that contains both variants. The PCR primers used were: 5'-ACC TCT AGT TAC 
ATA ACC TGA A-3' (forward primer; SEQ ID N0:6) and 5'-AAT AAA CCC GAC 
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CTC ACC AC-3' (reverse primer; SEQ ID N0:7). PCRs were performed in a 15 ^il 
volume containing 125 nM each primer, 2.5 mM MgCb, 50 each dNTP and 0.375 U 
of AmpliTaq Gold polymerase (Applied Biosystems) in the buffer provided by the 
manufacturer. PGR cycling conditions were for 40 cycles at 95°G for 15 s, 58°G for 15 s 
5 and 72°C for 30 s in a 9600 thermal cycler (Applied Biosystems). PGR amplified 
products were purified using shrimp alkaline phosphatase and exonuclease I by 
incubating at 37°C for 45 min prior to the SEE reaction. SEE reactions were performed 
in duplex for genotyping of both variants in 10 |il volumes containing 1 |a,M of extension 
primer (5'-GCC AAG GGT AGA GTT GAG T-3' (SEQ ID N0:8) for -3279G>T and 5'- 

10 GAG CCG AGG GGA CCT GTC-3' (SEQ ID N0:9) for -3156G>A), 250 jaM each 
ddNTP and 1.25 U thermosequenase (Amersham Pharmacia Biotech). Reactions were 
cycled at 96°C for 30 s, 55°C for 30 s and 60°C for 30 s for 60 cycles. Separation of the 
SEE products was performed on a WAVE 3500HT DHPLC system (Transgenomic Inc) 
at 70°C after denaturation of the samples. The flow rate used was 1.5 ml/min and the run 

1 5 time for each sample was 2.5 min. The gradient used for elution of the SBE products was 
created by the software based on the length of the extended product and was adjusted 
from 24% to 34% buffer B over 2 min (buffer B contains 25%) acetonitrile). Extended 
products were eluted in the order of C<G<T<A which is dependent on the hydrophobicity 
differences of the four bases. 

20 Genotyping of the 21 1G>A and 686G>A exon 1 variants was performed by PGR 

amplification of a 774 bp fragment that encompasses both variants. The PGR primers 
used were: 5'-ATG CTG GGA AGA TAG TGT TG-3' (forward primer; SEQ ID NO: 10) 
and 5'-TTT GGT GAA GGC AGT TGA TT-3' (reverse primer; SEQ ID NO: 11). PCRs 
were performed in a 15 |.il volume containing 125 nM each primer, 2.5 mM MgCh, 100 

25 [xM each dNTP and 0.375 U of AmpliTaq Gold polymerase (Applied Biosystems) in the 
buffer provided by the manufacturer. PGR cycling conditions were for 40 cycles at 95°G 
for 15 s, 55°C for 15 s and 72°C for 45 s in a 9600 thermal cycler (Applied Biosystems). 
PGR purification was performed as described above and the SBE reactions were 
performed in lOul volumes containing 1 |J,M of each extension primer (5'-GTC TTG 

30 AAG GTG TAA AAT GGT C-3' (SEQ ID NO: 12) for 211G>A or the 5'-GTG GGA 
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CGT GGT TTA TTC CC-3' (SEQ ID NO: 13) for 6860A) using the conditions 
described above. For separation on the DHPLC system, a flow rate of 1.5 ml/min and a 
run time of 3 min was used for each sample. The gradient used for elution of the SBE 
products was created by the WAVE software based on the length of the extended product 
5 and was adjusted from 25.6% to 38. 1% buffer B over 2.5 min. 

Pharmacokinetic Analysis 

Plasma concentrations of irinotecan and its metabolites were determined as 
previously published (Iyer et al, 2001). Pharmacokinetic parameters for irinotecan, SN- 
38, and SN-38G were calculated using standard non-compartmental methods with 
10 WinNonlin 2.0 (Pharsight Corporation, Mountain View, CA). The area imder the plasma 
concentration-time curve (AUG) from time zero to the last measured concentration of 
irinotecan and metabolites was determined by the linear trapezoidal method. The 
glucuronidation ratio was expressed as the ratio of the SN-38G AUG over SN-38 AUG. 

Statistical Analysis 

15 The study was originally designed to prospectively investigate the relationship 

between genetic variation in the UGTl Al promoter and grade 3-4 diarrhea. Results from 
clinical trials using the 350 mg/m2 every 3 weeks schedule suggested a 20 to 35% 
frequency of diarrhea (ref). Based on previously published data, a single-gene Mendelian 
model implied that 16% of patients would have the 7/7 genotype, 48% would have the 

20 6/7 genotype, and 36% would have the 6/6 genotype. A sample size of 60 would have 
had power of 0.8 at a=0.05 to detect a linear trend in the proportion of patients within 
each genotype experiencing grade 3-4 diarrhea defined by 60% of 7/7 patients, 30% of 
6/7 patients, and 10% of 6/6 patients. 

However, due to lower than expected frequency of grade 3-4 diarrhea (see below), 

25 the analyses were instead focused on the frequency of grade 4 neutropenia (ANG < 500 
^il"'). Nonparametric trend tests were used to investigate how the genotype is related to 
pharmacokinetic parameters, pretreatment bilirubin levels and ANG nadir. The 
relationship between genotype and grade 4 neutropenia was assessed by the use of 
Fisher's exact test and calculation of the relative risks. Univariate regression analyses 

30 were performed to identify the potential predictors of ANG nadir. They were performed 
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on the log scale for ANC to reduce skewness in the residuals. The pretreatment variables 
were also considered jointly via analysis of covariance (ANCOVA) models in order to 
identify the pretreatment measurements that can predict ln(ANC nadir). A different 
ANCOVA model simultaneously considering the pre- and post- treatment variables was 
5 used to explore the mechanism through which variability in UGTl Al status might affect 
the ANC nadir. 

EXAMPLE 7: 
Role of-3516G>A of UGTlAl in Irinotecan Toxicity 
Patient Characteristics 

10 Sixty-six patients were enrolled in the study (Table 5). Blood was mistakenly not 

drawn for DNA extraction in one patient and genotype information is available in 65 
patients. Sixty-three patients were assessable for toxicity as 3 patients (one 6/6, one 6/7, 
one 7/8) missed scheduled blood tests and/or physician appointments. Sixty patients are 
assessable for tumor response, as 6 of them were removed from the study before 

15 radiological assessment of tumor response. All the patients received prior chemotherapy 
regimens. Thirty-five of them received additional prior radiotherapy. 

Allele and Genotype Frequencies 

The TA indel allele frequencies were: TA6=0.68, TA7=0.29, TA8=0.02, 
20 TA5=0.01 . The TA5 and TAs alleles occurred exclusively in Black patients (one with 5/6, 
two patients with 6/8, and one patient with 7/8 genotype). -3279T and -3156A alleles 
had a frequency of 0.55 and 0.26, respectively. 

Table 6 shows the frequencies of promoter haplotypes comprising -3279, -3156, 
and the TA indel, based upon our previous publication on their linkage disequilibrium 
25 (Innocent! et al., 2002). The frequency of the haplotype pairs is shown in Table 7. No 
exon 1 variants (21 1G>A and 6860A) were detected in this patient population. 

Toxicity Prevalence, Relative Risk, Genetic Test. 

Toxicity of diarrhea and neutropenia refer to events observed during cycle 1 of 
30 treatment. The frequency of grade 4 neutropenia was 9.5%. Grade 4 neutropenia was 
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much more common in patients with genotype 7/7 (3/6, 50%) compared to patients with 
6/7 genotype (3/24, 12.5%) and 6/6 genotype (0/30, 0%) (p=0.001, Fisher's exact test). 
Nonparametric trend analysis revealed that the TA indel polymorphism is significantly 
conelated to ln(ANC nadir) (7/7<6/7<6/6, z=-2.35, p = 0.02) (FIG. 4). 

5 Because the -3 1 56G>A variant distinguishes between two different haplotypes in 

the TA7 individuals, the relative risk of grade 4 neutropenia was analyzed for the -3156 
AA genotype (versus AG and GG combined) and 7/7 genotype (versus 5/6, 6/6, 6/7 and 
6/8 combined). A higher relative risk was found in patients with -3156 AA genotype 
(14.0, 95% CI 2.1-36.7) compared to patients with 7/7 genotype (9.3, 95% CI 1.7-40.7, 

10 n=63). Moreover, the predictive power of a genetic test in patients receiving irinotecan 
was evaluated for both the TA indel and the -3156 variant (Table 8). The predictive 
power of either 7/7 or -3156 AA genotypes for grade 4 neutropenia was evaluated. In 
addition, the predictive power of either 6/6 or -3156 GG genotypes was evaluated in 
relation to the absence of grade 4 neutropenia (i.e., grade 0-3). In this comparison, the 

15 two 6/8 patients were regarded as either 6/6 or 6/7 genotypes in order to assess whether 
patients with the TAg allele might be a confounding factor for the results of the genetic 
test. 

While this study was originally conceived to examine the relationship between 
UGTlAl genotype and severity of diarrhea, the frequency of grade 3 diarrhea in our 
20 patients was only 5% (n=3), with no instances of grade 4 diarrhea. None of the three 
patients with grade 3 diarrhea were 6/6 (2 6/7 and one 7/7 genotypes). Concerning the 
diarrhea events in patients with the TAg allele (two 6/8 and one 7/8), only a grade 1 event 
was reported in one 6/8 patient. The low frequency of severe diarrhea did not allow any 
formal statistical analysis. 

25 

Total Bilirubin: Correlation with TA Indel Genotype and Toxicity 

Pretreatment total bilirubin levels were obtained in all patients (0.5±0.22 mg/dl, 

mean±SD, n=66). As is shown in FIG. 5, total bilirubin levels were significantly 
correlated with the TA indel polymorphism (nonparametric trend analysis, 7/7>6/7>6/6, z 
30 = 2.88, p < 0.01). Total bilirubin levels were significantly higher in 7/7 patients 
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compared to 6/6 and 6/7 patients combined (0.80±0.29 and 0.48±0.19 mg/dl, 
respectively, p = 0.0003). Concerning the distribution of the -3156 genotypes withing 
each TA indel genotype group, in the 6/7 genotype group, the three patients with GG 
genotype had low bilirubin levels of 0.3-0.4 mg/dl. Similarly, the two patients with 6/8 
5 and GG genotypes had low levels of bilirubin of 0.2-0.3 mg/dl. The one patient with GA 
genotype in the 7/7 group has a bilirubin level of 0.6 mg/dl, which is in the low range for 
this genotype group. The 7/8 patient did not have markedly elevated levels of total 
bilirubin as would be expected if the TAg allele resulted in decreased glucuronidation. 

In addition, the -3156 and the TA indel variants were correlated with total 
10 bilirubin by multiple regression analysis. The AA genotype showed a slightly better 
correlation (r^=0.28, p<0,0001) compared to 7/7 genotype, either when the TAg alleles 
were regarded as TAe (r^=0.23, p=0.002) or TA? (r^=0.20, p=0.0009). The other common 
variant -3279G>T had no significant association with total bilirubin. 

Whether pre-treatment bilirubin would correlate with neutropenia was also 
15 analyzed. Significantly higher bilirubin levels were observed in patients with grade 4 
neutropenia (0.83±0.21 mg/dl) compared to those without grade 4 neutropenia (0.47±0.20 
mg/dl) (p=0.0001) (FIG. 6). No cases of grade 4 neutropenia were reported in patients 
with bilirubin levels less than 0.6 mg/dl. Out of the 7 patients with total bilirubin higher 
then 0.7 mg/dl, 4 of them had grade 4 neutropenia. 
20 Correlation Between TA Indel Genotype and PK Parameters 

Table 9 describes the pharmacokinetic parameters of irinotecan and its 
metabolites stratified by 6/6, 6/7, and 7/7 genotypes. SN-38 AUG increases while 
increasing the number of ^A^ alleles (nonparametric trend analysis, 7/7>6/7>6/6, z = 
2.13, p = 0.03). Conversely, glucuronidation ratios (SN-38G/SN-38 AUC ratios) were 
25 reduced while increasing the number of TA? alleles, (nonparametric trend analysis, 
6/6>6/7>7/7, z=-2.16, p = 0.03). No significant trend was found for irinotecan and SN- 
38G AUCs (p>0.05). 
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Regression Analysis 

The impact of both pharmacokinetic variability and pre-treatment (including 
genotype) variables on variability in neutropenia was also examined. Instead of the TA 
indel genotype, the -3156 variant was used because 1) the -3156 genotype was better 
5 correlated with the risk of grade 4 neutropenia and 2) -3156 better reflected the UGTl Al 
status of patients, based upon the data on the correlation with total bilirubin. Univariate 
regression analyses of ANC nadir selected SN-38 AUG, total bilirubin and -3156 
genotype as the three best independent variables (Table 10). Gender showed a non- 
significant correlation with ANC nadir but it was included in further modeling because of 
10 possible gender differences in glucuronidation. Other variables did not show any 
correlation. 

Multivariate Analyses 

Several multivariate predictive ANCOVA models were considered to identify the 
15 pretreatment measurements that predict ln(ANC nadir). The final model (r^=0.41) was 
selected by backward elimination from Table 10 and is presented Table 11. Pretreatment 
bilirubin level is foimd to be very significant and negatively related to ln(ANC nadir). 
Gender and -3156 genotype are found to be marginally significant after adjusting for the 
total bilirubin level. Ln(ANC nadir) is found to have a lower value in women, and it 
20 decreases with increasing number of (TA)? alleles (6/6>6/7>7/7). Other factors, such as 
ethnicity, number of prior regimens, performance status, and ln(pretreatment ANC) are 
not found to be significant predictors of ln(ANC nadir) after adjusting for -3156 
genotype, gender and total bilirubin. 

After determining the predictive model using pre-treatment variables, the post- 
25 treatment measurements of irinotecan AUG, SN-38 AUG, SN-38G AUG, and 
glucuronidation ratio were added to the model as independent variables with the intention 
of determining the possible mechanism of how the variability in UGTlAl status affects 
ln(ANG nadir). The final model selected through backward elimination (r^=0.5141) 
which best predicts ln(ANG nadir) includes genotype and SN-38 AUG (p<0.001) (Table 
30 12). 
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Toxic Death and Response 

One toxic death was reported, as the patient died of neutropenia-related sepsis. 
He was admitted to the hospital on day 7 of cycle 1 with fever and an no neutrophils 
detected (white blood cell count of 100 |j,r'). He was empirically treated with 
5 ceftazadime, tobramycin, and fluconazole, though no infectious source was ever 
identified. Despite support with granulocyte colony stimulating factor, the patient 
remained neutropenic, became septic, and died on day 1 1 . He had 7/7 genotype and the 
highest level of pretreatment total bilirubin observed in these patients (1 .2 mg/dl). 

Concerning the response rates in this trial, three objective responses were 
10 observed. Two patients achieved a partial response (one with colorectal and the other 
with head and neck cancer) and had a 6/7 genotype. One colorectal cancer patient 
achieved a complete response and had a 6/6 genotype. 



Table 4 

1 5 UGTlAl variants typed in this study. Positions indicated are from the first base of the 



UGTlAl start site in the UGTIA cluster reference sequence (AF297093). 



Nucleotide 
change 


Amino acid 
change 


Exon 


-31560A 




Promoter 


-32790T 




Promoter 


TA indel 




Promoter 


2110A 


G71R 


1 


6860A 


P229Q 


1 



Table 5 
Patient characteristics 

No. of patients 



Patients 

Entered 66 

Assessable for toxicity 63 

Assessable for response 60 
Sex 

Male 39 

Female 27 
Age, median (range) 60 (34-85) 
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Ethnicity 

White 50 

Black 10 

Hispanic 4 

Pacific Islander 1 

Asian 1 
Performance Status 

100% 18 

90% 31 

80% 10 

70% 17 
Tumor type 

Colorectal 10 

Gastroesophageal 14 

Head and Neck 5 

Liver 2 

Lung 19 

Pancreas 3 

Unknown Primary 4 

Others 9 

Prior Radiotherapy 35 
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Table 6 



Frequency of UGTlAl promoter haplotypes. 



-32790T 


-3156G>A 


TA indel 


Frequency 


T 


G 


6 


0.55 


G 


G 


6 


0.13 


G 


A 


7 


0.25 


G 


G 


7 


0.03 


G 


G 


8 


0.02 


G 


G 


5 


0.01 



Table 7 
Frequency of haplotype pairs 



Haplotype pairs 


Frequency 


TG6/TG6 


0.28 


TG6/GA7 


0.28 


TG6/GG6 


0.18 


GA7/GA7 


0.08 


GG6/TA7 


0.06 


TG6/GG7 


0.05 


TG6/GG8 


0.02 


GG6/GG8 


0.02 


GG5/TG6 


0.02 


GA7/GG8 


0.02 



The haplotypes reflect the change of -3279, -3156, and the TA indel variants, such as the 
first base refers to -3279 variants, the second to -3 1 56 variant and the number refers to 
the number of TA repeats. 

Table 8 



Genetic tests for the TA indel and -3156 genotypes 





Sensitivity 


Specificity 


PPV 


NPV 


7/7, grade 4 


0.50(0.19-0.81) 


0.95 (0.85-0.98) 


0.50(0.19-0.81) 


0.95 (0.85-0.98) 


-3156 AA, grade 4 


0.50(0.19-0.81) 


0.96 (0.88-0.99) 


0.60 (0.23-0.92) 


0.95 (0.86-0.98) 


6/6, grade 0-3, 6/8=6/6 


0.57 (0.44-0.69) 


1.00(0.61-1.00) 


1.00 (0.89-1.00) 


0.20 (0.10-0.37) 


6/6, grade 0-3, 6/8=6/7 


0.54(0.41-0.66) 


1.00 (0.61-1.00) 


1.00 (0.89-1.00) 


0.19(0.09-0.35) 


-3156 GG, grade 0-3 


0.63 (0.49-0.74) 


1.00(0.61-1.00) 


1.00(0.90-1.00) 


0.22 (0.11-0.41) 



PPV, positive predictive value. 
NPV, negative predicted value. 



Data are shown with 95% CI in parenthesis. The patient with 5/6 genotype was regarded 
as having a 6/6 genotype. 
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Table 9 

Pharmacokinetic parameters and by 6/6, 6/7, and 7/7 TA indel genotypes 
Irinotecan SN-38" SN-38G Glucuronidation Ratio'' 
TA indel No. of AUC AUG AUG (SN-38G AUG/SN-38 AUG) 
genotype patients (ng*h/nil) (ng*h/ml)(ng*h/ml) 



6/6 30 24412.8 335.9 1954.2 6.52 

(7691.6) (167.7) (1361.1) (3.98) 

6/7 25 26085.5 458.4 1887.9 5.55 

(10814.2) (379.8) (1682.5) (4.79) 

7/7 6 25432.9 542.0 1819.1 3.59 

(6694.9) (195.3) (1249.8) (2.81) 

%l6<6n<in, TrO-U, p=0.03, non-parametric trend analysis. 
5 ''6/6>6/7>7/7, z=-2. 1 6, p=0.03, non-parametric trend analysis. 



Data expressed as mean (standard deviation). 



Table 10 

Univariate analysis of ln(ANC nadir) 



Independent Variable 



SN-38 AUC 0.3523 <0.0001 

Pre-treatment total bilirubin 0.2979 <0.0001 

-3156 genotype 0.2413 0.0003 

T . , .Tin 0.1273 0.0041 

Irmotecan AUC 

Glucuronidation ratio 0.1171 0.0060 

Gender 0.0445 0.0971 

SN-38GAUC 0.0411 0.1109 

Age > 70 0.0242 0.2231 

0.0128 0.3764 

0.0000 0.9749 



White ethnicity 
Ln(pre-treatment ANC) 
Performance status 0.0016 0.9923 
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Table 11 

ANCOVA for the final predictive model of ln(ANC nadir) using pre-treatment 
variables 

5 





Coefficient 


SE 


p-value 


Intercept 


8.1885 


0.2767 


<0.001 


Genotype 








AA vs. GG+GA 


-0.9401 


0.3986 


0.022 


Gender 








Males vs. Females 


0.4323 


0.2001 


0.035 


Total Bilirubin 


-1.8452 


0.4816 


<0.001 



SE, standard error 

The overall model shows an revalue of 0.4048 (p<0.0001). 



Table 12 

ANCOVA for the final predictive model of ln(ANC nadir) using pre-treatment and 
10 post-treatment variables 

Coefficient SE p-value 

Intercept 8.3111 0.1517 <0.001 
Genotype 

AA vs. GG+GA -1.3798 0.3234 <0.001 

SN-38AUC -0.0019 0.0003 <0.001 
SE, standard error 

The overall model shows an revalue of 0.5128 (p<0.0001). 



15 

All of the compositions and/or methods disclosed and claimed herein can be made 
and executed without undue experimentation in light of the present disclosure. While the 
compositions and methods of this invention have been described in terms of preferred 
embodiments, it will be apparent to those of skill in the art that variations may be applied 
20 to the compositions and/or methods and in the steps or in the sequence of steps of the 
method described herein without departing from the concept, spirit and scope of the 
invention. More specifically, it will be apparent that certain agents that are both 
chemically and physiologically related may be substituted for the agents described herein 
while the same or similar results would be achieved. All such similar substitutes and 
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modifications apparent to those skilled in the art are deemed to be within the spirit, scope 
and concept of the invention as defined by the appended claims. 
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